Readme
Qwen3-Reranker-0.6B is a lightweight yet powerful cross-encoder reranking model from Alibaba’s Qwen team, designed to boost retrieval accuracy in two-stage RAG pipelines. It scores query-document pairs with high precision, perfect for re-ranking top-k results from vector search.
-Cross-Encoder Architecture: Processes query and document together for superior relevance scoring vs. bi-encoders
-Long Context: 32K token context window handles long documents without truncation
-Instruction-Aware: Task-specific instructions improve ranking accuracy by 1–5%
-Multilingual: Supports 100+ languages including code, enabling cross-lingual reranking
| Specification | Value |
|---|---|
| Parameters | 0.6B (600M) |
| Architecture | Dense Transformer decoder (Causal LM) |
| Layers | 28 |
| Context Length | 32,768 tokens |
| Scoring Method | Yes/No token logits |
| Languages | 100+ |
| License | Apache 2.0 |
| Release Date | June 2025 |