Q&A — Ask about my profile

Ask any question about 2Z1T Conseil's skills, experience or availability.

🔧 Under the hood

🔍 Hybrid retrieval

The answer is not generated from thin air. Each question triggers a search through my profile (BM25 + vector similarity) to extract relevant passages before feeding them to the model.

🤖 Compact model, evaluated choice

The LLM used is Qwen2.5 1.5B (~1 GB). It was selected from several candidates based on a measured relevance score.

🧪 A testing tool built for the purpose

Model selection relied on a custom automated evaluation framework built specifically for this use case: a reference question corpus, candidate model execution, relevance scoring. Test, measure, decide — that is precisely my domain.

⚖️ A deliberate trade-off

Rather than a cloud API, I chose a local model running on CPU. Latency is higher — that is the price of a solution that is economically coherent with low usage and fully under my control.