Re-ranking
Re-ranking is a retrieval technique that uses a more powerful model to reorder an initial set of search results by relevance, significantly improving the quality of the final results presented to the user or AI model.
What is Re-ranking?
Why Re-ranking Matters for Business
Related Terms
Explore further
FAQ
Frequently asked questions
Typically 20-100 candidates are re-ranked. Fewer candidates may miss relevant results; more adds latency without significant quality improvement. The optimal number depends on the diversity of your content and the quality of initial retrieval.
Cross-encoder re-ranking typically adds 50-200 milliseconds per query, depending on the number of candidates and model size. This is usually acceptable for interactive applications and negligible for batch processing.
Yes. LLMs can be used as re-rankers by asking them to score or compare result relevance. This can be effective but is slower and more expensive than dedicated re-ranking models. Purpose-built re-rankers typically offer a better speed-quality trade-off.
Need help implementing this?
Our team can help you apply these concepts to your business. Book a free strategy call.