DevOps Engineer - AI Model Evaluator

Obsidian

helsinki, uusimaa, Finland Full-time July 01, 2026

Opportunity Description

About the Role Mercor is partnering with a leading AI research lab to support a Frontier Code Agents project. 
Contributors help evaluate and improve frontier AI coding models through structured technical assessments. 
The work focuses on realistic infrastructure engineering workflows and model evaluation. 
Spots are limited and filling quickly on a first come, first serve basis. 
What You'll Do Use frontier AI coding agents to complete and evaluate complex infrastructure engineering tasks. 
Review model-generated implementations involving cloud platforms, Kubernetes, CI/CD systems, observability, and infrastructure automation. 
Identify bugs, edge cases, reliability issues, and failure modes. 
Compare outputs from multiple frontier models and assess their strengths and weaknesses. 
Apply professional engineering judgment to realistic infrastructure engineering scenarios. 
<...
        

Full-time Software Development, Software Architecture & Engineering

Interested in this opportunity? Apply now through Expertini.

Apply for this Position

Location helsinki, uusimaa

Country Finland

Type Full-time

Category Software Development, Software Architecture & Engineering

Posted July 01, 2026

Deadline August 10, 2026

DevOps Engineer - AI Model Evaluator

Opportunity Description

About the Role

What You'll Do

Opportunity Details

About Obsidian

Obsidian