The problem
A language model that does not have access to your TM is not using your most valuable localization asset. It generates from scratch on every segment, ignoring years of reviewed translations and thousands of approved terminology decisions. The output is linguistically plausible but institutionally ignorant: it does not know what your organization has decided, only what the training corpus suggested.
The naive solution is prompt engineering: paste the relevant TM segments into the prompt manually. This does not scale. A production localization pipeline handles thousands of segments per release. Manual context injection is not a workflow; it is a demonstration.
The actual solution is retrieval architecture. At inference time, the pipeline automatically finds the most relevant approved segments from your TM, the approved terminology from your termbase, and the applicable rules from your style guide, and assembles them into the prompt before the model generates. The model sees your institutional knowledge. It generates with it in context.
How we approach it
We begin with an audit of your linguistic assets: TM quality and coverage, termbase completeness, style guide specificity. A retrieval pipeline is only as good as what it retrieves. If the TM is inconsistent, we recommend a cleanup pass before vectorization. Garbage in is not something retrieval architecture can fix.
Embedding strategy matters. We select and evaluate embedding models for your language pairs and domain. A model trained on general web text encodes domain-specific vocabulary differently than one trained on technical or legal text. We measure retrieval precision on a sample of your content before committing to an indexing strategy.
The retrieval logic is explicit and auditable. When the pipeline retrieves a TM segment and includes it in the prompt, that decision is logged: which segment was retrieved, what the match score was, what the output was. This makes quality review tractable: a reviewer can see exactly what context the model was working with for each segment.
Output validation is a first-class component, not an afterthought. Every generated segment passes through structured checks: tag preservation, approved terminology presence, character count constraints, forbidden-term absence. Segments that fail validation are routed to human review rather than passed through.