All roles

Generative AI Evaluator (Russian) | $15/hr Remote

Remote · USA Full-time New today

Position: AI Quality Analyst (Russian) Type: Hourly contract Compensation: $15/hour Location: Remote Commitment: 30–40 hours/week Role Responsibilities

  • Evaluate AI model responses for personalization quality, including grounding, integration, and helpfulness.
  • Design and execute multi-turn prompts based on personal context to test AI capabilities.
  • Analyze responses for hallucinations, incorrect personalization, and poor inferences.
  • Perform side-by-side comparison of model outputs to determine quality and effectiveness.
  • Write clear and structured rationales for response evaluations and rankings.
  • Extract and verify debug information to ensure proper use of data sources.
  • Maintain strict data hygiene and ensure accurate documentation of evaluations.
  • Collaborate with cross-functional teams to improve AI model performance.

Requirements

  • Strong proficiency in Russian with excellent reading and writing skills.
  • Experience in data annotation, AI evaluation, content moderation, or a related role.
  • Strong analytical thinking and ability to assess nuanced AI responses.
  • Ability to design creative, multi-turn prompts based on personal context.
  • Understanding of personalization concepts, including identifying incorrect or forced personalization.
  • High attention to detail in evaluating subtle differences in model outputs.
  • Excellent written communication and structured reasoning skills.
  • Ability to work independently in a remote environment.
  • Willingness to use a personal Google account for evaluation purposes.
  • Full-time availability with at least 4 hours overlap with PST.
  • Bachelor’s degree or equivalent experience in a relevant analytical field.

Application Process

  • Apply/Easy Apply and check email for application form
  • Fill Google form
  • Assessment Link (After shortlisting – to be completed within 24 hours)
  • Language vetting

Apply tot his job Apply To this Job

Related roles

Product Manager - Healthcare (Remote)

Remote · USA Full-time

Product Owner (Specialty Lines Insurance)

Remote · USA Full-time

Product Owner – Digital Enablement

Remote · USA Full-time

Product Owner (Data Center) || W.2 only, No C.2.C & No H.1s, E.A. Ds

Remote · USA Full-time

AI Product Owner- Quote & Order Management

Remote · USA Full-time

Senior Manager, Data & AI Product Owner – Clinical Development - Foster City

Remote · USA Full-time

Senior Product Owner - AI

Remote · USA Full-time

Perioperative Software Product Owner - prefer RN/NP w OR or ASC skills

Remote · USA Full-time

Salesforce Product Owner, Administrator

Remote · USA Full-time

Product Owner

Remote · USA Full-time

Registered Nurse , CDI (Clinical Documentation), Harborview Medical Center

Remote · USA Full-time

PM with Debt and Collections Experience

Remote · USA Full-time

Experienced Work From Home Customer Service Representative – Part-Time Opportunity at arenaflex

Remote · USA Full-time

Maltese Interpreter

Remote · USA Full-time

Experienced Junior Data Entry Specialist – Remote Opportunity at arenaflex

Remote · USA Full-time

Experienced Call Center Representative/Customer Service Professional – Data-Driven Solutions Expert

Remote · USA Full-time

Experienced Part-Time Online Customer Service Representative – arenaflex Chat Support Team

Remote · USA Full-time

Customer Service Representative - Luxury Retail – Remote USA at arenaflex

Remote · USA Full-time

Experienced Customer Service Manager – Paso Robles, CA

Remote · USA Full-time

Specialist I, Prior Authorization-Lumicera

Remote · USA Full-time