Machine Learning Engineer, AI Powered - AI Framework

at GitLab  
Software
About the job
REMOTEWFH 5 days a weekSan Francisco, California, United StatesFull-Time ~ Permanent
Open to new applications

1 job requirement

Preview the competition
5 years Python experience, used daily Must Have  

GitLab is an open core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute to and co-create the software that powers our world. When everyone can contribute, consumers become contributors, significantly accelerating the rate of human progress. This mission is integral to our culture, influencing how we hire, build products, and lead our industry. We make this possible at GitLab by running our operations on our product and staying aligned with our values. Learn more about Life at GitLab.

An overview of this role

Are you passionate about building robust frameworks to evaluate and ensure the reliability of AI models? As a Machine Learning Engineer on GitLab’s AI framework team, you’ll play a critical role in shaping the future of AI-powered features at GitLab. This is an exciting opportunity to work on impactful projects that directly influence the quality of GitLab’s AI capabilities.

You’ll help merge cutting-edge evaluation tools, optimize dataset management, and scale our validation infrastructure. Working closely with other AI feature teams, you’ll ensure that every AI feature we deliver is robust, reliable, and meets the highest quality standards.

Some challenges in this role include designing scalable solutions for LLM evaluation, consolidating disparate validation tools, and contributing to GitLab’s innovative AI roadmap.

Some examples of our projects:

Consolidating Evaluation Tooling | The GitLab Handbook GitLab.org / AI Powered / ELI5

What You’ll Do

  • Design and implement technical evaluators for LLM assessment.
  • Contribute to evaluation infrastructure consolidation efforts.
  • Build scalable evaluation pipelines and frameworks.
  • Develop and manage datasets and evaluation metrics.
  • Collaborate with feature teams to integrate validation solutions.
  • Optimize performance across ML evaluation systems.
  • Support improvements to GitLab’s AI-powered tools through validation.
  • Ensure all solutions align with GitLab’s infrastructure and security protocols.

What You’ll Bring

  • Proven experience designing and implementing LLM evaluation systems.
  • Strong understanding of ML model architectures, including public vs. private implementations.
  • Expertise in ML evaluation metrics and dataset management.
  • Demonstrated ability to build production-grade ML infrastructure.
  • Practical experience with Python-based ML frameworks and evaluation tools (e.g., Langsmith, ELI5).
  • Excellent problem-solving skills with an engineering mindset.
  • Ability to collaborate in an asynchronous, remote-first environment.
  • Familiarity with open-source development and contribution is a plus.

About the team

The AIF team ensures that AI models across GitLab are reliable and well-validated. We focus on building robust evaluation frameworks, consolidating tools, and streamlining processes to scale validation efforts across GitLab’s AI infrastructure. Working on high-impact projects, the team partners with AI feature teams to deliver quality-focused solutions that enhance user trust and product performance.

How GitLab will support you

Remote-Global

Country Hiring Guidelines: GitLab hires new team members in countries around the world. All of our roles are remote, however some roles may carry specific location-based eligibility requirements. Our Talent Acquisition team can help answer any questions about location after starting the recruiting process.

GI

GitLab

Classification:

blurTagText
blurTagText

Enable 1-click access to other sources:
                  and more
Do your research faster with Quick Links
Details and stages

Reporting to: details unknown

the hiring process information will appear here if available.

Job ref blurredText

Posted on blurredText

Last checked on blurredText

Closing on blurredText

1 discussion comments
0 requirements
4 Saved as Applied
Qualify To Apply check results
Total attempts: 22  Unique: 10  Passed: 6
Understand who you are up against with Competitive Insights
Discuss this job
Share your intel on this vacancy and help others - anonymously
Team inkscroll - 0 days ago

pretend that this is a blurredText long comment