Senior Software Engineer, Data Platform

🇺🇸 Emeryville, California
$2K - $2K Annual
Posted 1 week ago
Expires July 6, 2026
Full TimeOn-siteEngineeringData Science

Profluent is seeking a Senior Software Engineer to design, build, and scale its data platform, which houses data from protein engineering campaigns, including protein designs, experimental results, partner datasets, analytical outputs, and model-ready training data. This platform enables rapid machine learning, biological discovery, and secure collaboration across internal and external programs. The role involves working closely with machine learning engineers, bioinformatics experts, and program teams to ensure data is organized, governed, accessible, and protected.

Key responsibilities include designing, building, and maintaining scalable data infrastructure for protein engineering campaigns, developing secure data pipelines for internal and partner-generated data, and owning core components of Profluent’s data warehouse and data platform using Python, GCP, PostgreSQL, BigQuery, and related cloud-native technologies. The engineer will also build systems that transform raw experimental, computational, and partner data into structured, reliable, analysis-ready, and model-ready datasets, establish best practices for data modeling, metadata management, data quality checks, schema evolution, versioning, and documentation, and collaborate with various teams to translate data requirements into scalable technical systems.

The ideal candidate will have over 5 years of experience in software engineering, data engineering, or data platform development, with strong proficiency in Python and modern software development practices, including git, testing, code review, CI/CD, and production deployment. Experience in designing and operating production data pipelines, data warehouses, and data models at scale is essential. Hands-on experience with cloud platforms, preferably GCP, and technologies such as BigQuery, PostgreSQL, object storage, workflow orchestration, and containerized services is required. A strong understanding of data security, access control, data partitioning or siloing, audit logging, and managing sensitive or restricted datasets is also necessary. A BS, MS, or PhD in Computer Science, Engineering, Data Science, Bioinformatics, or a related technical field, or equivalent practical experience, is required.

Profluent offers a high-growth opportunity with meaningful impact on the future of protein design, a competitive compensation package with equity participation, a 401(k) with a strong employer match, comprehensive benefits including health/dental/vision insurance, a generous PTO policy, and professional development opportunities in a cutting-edge field at the intersection of AI and biology.

The company fosters a collaborative and innovative culture, encouraging cross-disciplinary teamwork and continuous learning. Employees have the opportunity to work on groundbreaking projects that aim to revolutionize biomedicine through AI-driven protein design. Profluent is committed to diversity and inclusion, promoting a workspace where all individuals are valued and supported.

More Jobs at Profluent