Machine Learning Engineer, AI Powered - AI Framework
at GitLabAlready Applied? Save to tracker
About the job
GitLab is an open core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute to and co-create the software that powers our world. When everyone can contribute, consumers become contributors, significantly accelerating the rate of human progress. This mission is integral to our culture, influencing how we hire, build products, and lead our industry. We make this possible at GitLab by running our operations on our product and staying aligned with our values. Learn more about Life at GitLab.
An overview of this role
Are you passionate about building robust frameworks to evaluate and ensure the reliability of AI models? As a Machine Learning Engineer on GitLab’s AI framework team, you’ll play a critical role in shaping the future of AI-powered features at GitLab. This is an exciting opportunity to work on impactful projects that directly influence the quality of GitLab’s AI capabilities.
You’ll help merge cutting-edge evaluation tools, optimize dataset management, and scale our validation infrastructure. Working closely with other AI feature teams, you’ll ensure that every AI feature we deliver is robust, reliable, and meets the highest quality standards.
Some challenges in this role include designing scalable solutions for LLM evaluation, consolidating disparate validation tools, and contributing to GitLab’s innovative AI roadmap.
Some examples of our projects:
Consolidating Evaluation Tooling | The GitLab Handbook GitLab.org / AI Powered / ELI5
What You’ll Do
- Design and implement technical evaluators for LLM assessment.
- Contribute to evaluation infrastructure consolidation efforts.
- Build scalable evaluation pipelines and frameworks.
- Develop and manage datasets and evaluation metrics.
- Collaborate with feature teams to integrate validation solutions.
- Optimize performance across ML evaluation systems.
- Support improvements to GitLab’s AI-powered tools through validation.
- Ensure all solutions align with GitLab’s infrastructure and security protocols.
What You’ll Bring
- Proven experience designing and implementing LLM evaluation systems.
- Strong understanding of ML model architectures, including public vs. private implementations.
- Expertise in ML evaluation metrics and dataset management.
- Demonstrated ability to build production-grade ML infrastructure.
- Practical experience with Python-based ML frameworks and evaluation tools (e.g., Langsmith, ELI5).
- Excellent problem-solving skills with an engineering mindset.
- Ability to collaborate in an asynchronous, remote-first environment.
- Familiarity with open-source development and contribution is a plus.
About the team
The AIF team ensures that AI models across GitLab are reliable and well-validated. We focus on building robust evaluation frameworks, consolidating tools, and streamlining processes to scale validation efforts across GitLab’s AI infrastructure. Working on high-impact projects, the team partners with AI feature teams to deliver quality-focused solutions that enhance user trust and product performance.
How GitLab will support you
- Benefits to support your health, finances, and well-being
- All remote, asynchronous work environment
- Flexible Paid Time Off
- Team Member Resource Groups
- Equity Compensation & Employee Stock Purchase Plan
- Growth and development budget
- Parental leave
- Home office support
Remote-Global
Country Hiring Guidelines: GitLab hires new team members in countries around the world. All of our roles are remote, however some roles may carry specific location-based eligibility requirements. Our Talent Acquisition team can help answer any questions about location after starting the recruiting process.
GitLab
Classification:
Details and stages
Reporting to: details unknown
the hiring process information will appear here if available.
Job ref blurredText
Posted on blurredText
Last checked on blurredText
Closing on blurredText
Total attempts: 22 Unique: 10 Passed: 6
Discuss this job
Share your intel on this vacancy and help others - anonymously
pretend that this is a blurredText long comment