Similarity ranking - ML & AI Coding

54. Similarity ranking

easy

General

senior

Build a similarity-based ranking function for an embeddings retrieval system, where you rank candidate items by how close their embedding vectors are to a query embedding. Use cosine similarity to score each candidate:

\text{cos\_sim}(q, x_i) = \frac{q \cdot x_i}{\|q\|_2 \, \|x_i\|_2}

Requirements

Implement the function

python

Rules:

Compute cosine similarity between query and each vector in candidates.
Return the indices of the top_k most similar candidates, sorted from most to least similar.
Break ties by smaller index first (stable ranking).
Don’t use any prebuilt similarity or nearest-neighbor utilities (e.g., sklearn, scipy).
Keep it in a single Python function using only NumPy + built-ins.

Example

python

Output:

python

Input Signature

Argument	Type
query	np.ndarray
top_k	int
candidates	np.ndarray

Output Signature

Return Name	Type
value	np.ndarray

Constraints

Input query and candidates are NumPy arrays.
top_k is a positive integer.
Return indices as a NumPy array.

Hint 1

Use matrix multiplication candidates @ query to compute dot products for all items at once.

Hint 2

To normalize, divide the dot product vector by (|query| * |candidates|) broadcasting where necessary.

Hint 3

Use np.argsort(-sims, kind='stable') to sort indices by descending similarity while preserving original order for ties.

Roles

ML Engineer

AI Engineer

Companies

General

Levels

senior

entry

Input Arguments

Edit values below to test with custom inputs

You need tolog in/sign upto run or submit