Remote Senior Software Engineer (LLM) – 35501 – Turing

Company: Turing
Palo Alto, California | Contract |

Company Snapshot

Turing is an AI company focused on advancing AI-assisted software development. The company works to empower the next generation of AI systems to reason about and work with real-world software repositories. This project involves developing high-quality training and evaluation datasets to improve how Large Language Models (LLMs) perform on real software engineering problems. The core task includes identifying and curating verifiable coding tasks from public GitHub repositories, supported by a human review process.

Role Overview / Mission

This contract role involves reviewing AI-generated code that solves real software tasks. Your feedback will be used to improve how future AI models learn to write and understand code.

Key Responsibilities

* Review and compare 3–4 model-generated code responses for each task using a structured ranking framework.
* Assess code changes (diffs) for correctness, quality, readability, and performance.
* Provide clear, concise explanations for ranking decisions.
* Maintain consistency and fairness across all evaluations.
* Identify and document edge cases or unusual model behavior.
* Collaborate with the team to improve evaluation processes and identify improvement areas.

Required Qualifications / Skills

**Professional Experience**
* Several years of experience as a software engineer. Experience working as a data scientist will not be considered.
* Minimum 2 years of experience as a full-stack engineer at a leading tech product company (e.g., Google, Shopify, Microsoft, Snowflake, Meta, PayPal).
* Only full-time employment experience qualifies; contractual or part-time roles will not be considered.

**Technical Skills**
* Strong understanding of software design, debugging, and engineering best practices.
* Familiarity with code review processes and version control systems.
* Ability to analyze and compare real-world code changes.
* Excellent written communication skills for clearly explaining technical evaluations.

Preferred / Nice-to-Have Skills

* Degree from a top-ranked university.
* Experience working with LLM-generated code or AI evaluation projects.
* Background in developer tools or systems automation.
* Exposure to AI research or developer agents.

Location & Work Setup

This is a contract position with a commitment of 10–20 hours per week. Work hours are flexible but require partial overlap with Pacific Time. The initial duration is 1 month, with potential for extension.

Compensation & Benefits

Compensation ranges from $50–$150 per hour, based on experience, geography, and skill level. This contract role does not include medical benefits or paid leave.

Timezone: Asia/Kolkata
Posted: Sep 08, 2025
Expires: Oct 08, 2025