Senior Applied Scientist, Document Understanding
Thomson ReutersRemotePosted 7 April 2026
Job Description
Senior Applied Scientist, Document Understanding
About the Role
This role sits within the applied science function. You will design, build, and deploy document understanding systems that directly power Westlaw, PracticalLaw, and CoCounsel. The problems are real, the scale is large, and the expectation is shipped, reliable, measurable impact.
You will work across semantic chunking, document enrichment, knowledge graph construction, and synthetic data generation for complex legal, tax, and accounting content. Multiple product teams depend on what this function delivers.
About You
You hold a PhD or Master's in Computer Science, AI, NLP, or a related field, with 5 years of post-degree industry experience taking NLP and document understanding systems from development to production at scale. You have hands-on depth across model development, distillation, evaluation, and deployment. You publish, you work independently, lead through influence in an applied research setting, and measure success by what ships and performs in production.
What You'll Do
Design and deploy semantic chunking models for lengthy, non-uniformly structured legal documents with adjustable granularity across use cases
Build document enrichment systems using legal and customer-defined taxonomies
Develop LLM-based knowledge graph construction pipelines that extract and link citations, entities, and legal concepts across diverse legal content
Build scalable synthetic data generation systems for model training, multi-hop query simulation, and hallucination-free answer generation
Apply knowledge distillation techniques to compress large models into latency-constrained, production-ready SLMs
Design evaluation frameworks — component-level and end-to-end — using expert annotation and synthetic data
Drive technical decisions on architecture, chunking strategy, classification approach, and knowledge extraction methods
Partner with engineering on delivery, reliability, and scale across multiple product lines
Contribute to published research at venues such as ACL, EMNLP, ICLR, NeurIPS, SIGIR, and KDD, and to intellectual property
Required Qualifications
PhD or Master's in Computer Science, AI, NLP, or a related field
5 years of post-degree industry experience shipping document understanding, information extraction, or knowledge graph systems into production — not research-only experience
Publications at ACL, EMNLP, ICLR, NeurIPS, SIGIR, KDD, or equivalent
Experience leading through influence in an applied research setting
Production Python and experience with PyTorch, Hugging Face Transformers, and DeepSpeed
Hands-on production depth required in:
Document layout analysis and semantic chunking beyond fixed-size or paragraph-based methods
Hierarchical, multi-label document classification with domain-specific and customer-defined schemas
Entity recognition and linking, relation extraction, citation parsing, and knowledge graph construction from unstructured text
LLM-based information extraction, few-shot and multi-task learning, and post-training
Knowledge distillation, model compression, and SLM deployment under latency constraints
Synthetic data generation and annotation workflow design
End-to-end evaluation framework design for document understanding
Preferred Qualifications
Legal document understanding, legal IE, or legal AI experience
Complex document structures: nested hierarchies, cross-references, non-uniform formatting
Retrieval or QA systems over large document collections
RAG and agentic workflows in enterprise settings
Knowledge graph frameworks for legal or enterprise applications
AzureML or AWS SageMaker
#LI-LP2
What’s in it For You?
Flexibility & Work-Life Balance: Flex My Way is a set of supportive workplace policies designed to help manage personal and professional responsibilities, whether caring for family, giving back to the community, or finding time to refresh and reset. This builds upon our flexible work arrangements, ... (truncated, view full listing at source)
Apply Now
Direct link to company career page
AI Resume Fit Check
See exactly which skills you match and which are missing before you apply. Free, instant, no spam.
Check my resume fitFree · No credit card