ML Research Scientist I/II, Multimodal Data Extraction
As an ML Research Scientist specializing in Multimodal Data Extraction at Lila Sciences, you will play a pivotal role in advancing the company's mission to develop scientific superintelligence. This position involves creating foundation models capable of autonomously reading, interpreting, and structuring scientific knowledge across various formats, including text, images, and experimental data. Your contributions will be instrumental in transforming complex scientific information into machine-understandable forms, thereby enhancing reasoning, prediction, and autonomous discovery in fields such as materials science and chemistry.
In this role, you will be responsible for researching and developing AI systems that extract and structure knowledge from diverse scientific sources. You will design and fine-tune large language models, multimodal models, and specialized models to ensure factual and interpretable data extraction. Building scalable pipelines for unstructured and heterogeneous scientific data, integrating text, tables, and visuals, will be a key part of your day-to-day work. Collaboration with domain experts to align extracted data with real-world discovery workflows and publishing research to advance the state of the art in multimodal understanding and AI-driven knowledge extraction are also essential responsibilities.
The ideal candidate will possess a PhD (or equivalent research experience) in Computer Science, Chemistry, Materials Science, or a related field. Expertise in machine learning, natural language processing (NLP), and vision–language modeling using tools such as PyTorch and Hugging Face Transformers is required. Proven ability to train, fine-tune, and evaluate large language models (LLMs) and multimodal models for scientific data extraction is essential. A strong understanding of data structures and representations used in the physical sciences, along with a demonstrated research impact through publications, preprints, or open-source work in reputable conferences or journals, is highly desirable.
Lila Sciences offers a competitive compensation package, with an expected base salary ranging from $176,000 to $304,000 USD per year, along with bonus potential and generous early equity. The final offer will reflect your unique background, expertise, and impact. Full-time U.S. employees receive a comprehensive benefits program, including medical, dental, and vision coverage; employer-paid life and disability insurance; flexible time off with generous company-wide holidays; paid parental leave; an educational assistance program; commuter benefits, including bike share memberships for office-based employees; and a company-subsidized lunch program.
Joining Lila Sciences means becoming part of a pioneering team dedicated to building the world's first scientific superintelligence platform and autonomous lab for life, chemistry, and materials science. The company is committed to applying AI to every aspect of the scientific method, aiming to solve humankind's greatest challenges in human health, climate, and sustainability. Guided by core values of truth, trust, curiosity, grit, and velocity, Lila Sciences offers an environment that moves with startup speed while tackling problems of historic importance. If you are passionate about advancing AI in the physical sciences and eager to contribute to groundbreaking discoveries, this is an opportunity to make a significant impact.