light770/qwen3-reranker-0.6b

Compact Powerhouse for Vector Reranking

Public
7 runs

Qwen3-Reranker-0.6B is a lightweight yet powerful cross-encoder reranking model from Alibaba’s Qwen team, designed to boost retrieval accuracy in two-stage RAG pipelines. It scores query-document pairs with high precision, perfect for re-ranking top-k results from vector search.

-Cross-Encoder Architecture: Processes query and document together for superior relevance scoring vs. bi-encoders

-Long Context: 32K token context window handles long documents without truncation

-Instruction-Aware: Task-specific instructions improve ranking accuracy by 1–5%

-Multilingual: Supports 100+ languages including code, enabling cross-lingual reranking

Specification Value
Parameters 0.6B (600M)
Architecture Dense Transformer decoder (Causal LM)
Layers 28
Context Length 32,768 tokens
Scoring Method Yes/No token logits
Languages 100+
License Apache 2.0
Release Date June 2025
Model created