You are not logged-in. Login now to submit a quote »

Freelance AI Evaluation & Validation Specialists

Job posted at: Jun 8, 2026 08:37 GMT   (GMT: Jun 8, 2026 08:37)
Job approved and potential candidates notified at: Jun 8, 2026 11:25 GMT

Job type: Translation/editing/proofing job
Service required: Translation


Languages: English to Indonesian, English to Korean, English to Thai, English to Vietnamese, Indonesian to English, Korean to English, Thai to English, Vietnamese to English

Job description:
Engagement type: Independent Contractor (IC) / Supply-as-a-Service / Freelancers
Location: Remote
Project description Uber AI Solutions is seeking high-level Bilingual AI Evaluation & Validation Specialists to support one of the largest AI frontier labs in the world. This project focuses on the rigorous testing and quality vetting of generative AI models to ensure linguistic accuracy and safety across global markets.

Supported languages We are sourcing a small, elite bench (2–5 specialists per language) for the following pairs:
English (Native-level)
Vietnamese
Korean
Indonesian
Thai

Scope of work As a Quality Vetting Specialist, your responsibilities include:
Prompt Quality Review: Evaluating the complexity, relevance, and safety of prompts used to train frontier AI models.
Linguistic Validation: Reviewing model outputs for high-level language accuracy, grammatical precision, and cultural nuance.
Model Testing: Conducting iterative testing to identify edge cases or linguistic failures in the target language.
Collaborative Alignment: Joining live synchronization calls in CST (Central Standard Time) to coordinate with Program Managers on evolving quality standards.

Skills and qualifications
Academic Background: Master’s degree preferred (Linguistics, Computer Science, or related fields).
AI Expertise: Prior experience in AI evaluation, RLHF (Reinforcement Learning from Human Feedback), or model testing is highly ideal.
Bilingual Mastery: Complete fluency in both English and the target language (written and verbal).
Technical Rigor: Ability to provide highly analytical feedback on complex linguistic data under tight deadlines.
Availability: Willingness to commit to 20–35 hours per week for a multi-week engagement.

Budget and payment details:
Budget information for this job is restricted to those who meet the requirements of the job.
Service provider targeting (specified by job poster):
Membership: Non-members may quote after 12 hours
Subject field: Linguistics
Quoting deadline: Aug 30, 2026 23:13 GMT
Delivery deadline: Aug 31, 2026 23:13 GMT
About the outsourcer:
This job was posted by a Blue Board outsourcer with a "likelihood of working again" average rating of 1 out of 5

Note: Sign in to see outsourcer contact information.