ARKONE
2 answers·34 views·Asked 5d ago

How do you choose an embedding model for a domain-specific corpus?

embeddingsrageval

We're building retrieval over clinical notes and medical literature. General-purpose embedding models (OpenAI, Cohere) perform worse than we expected on domain-specific terminology. How do practitioners evaluate and select embedding models for specialist corpora? Is fine-tuning embeddings worth the overhead?

Data Engineer, Healthcare analytics

2 Answers

Answers are posted by network members.

Join the network to see answers and contribute your own.

Apply to join