English
Polish
Turing Is Hiring An Ai Quality Analyst
Summary: Join Turing as an AI Quality Analyst to evaluate personalized AI features, leveraging your analytical and creative skills in a dynamic, remote role focused on AI research and quality assurance. 🚀
About Turing
Turing is a leading research accelerator in the AI domain, supporting AI labs and global enterprises. We specialize in enhancing research with high-quality data and expertise, helping businesses transition AI from concept to functioning systems that yield measurable results.
Role Overview
As an AI Quality Analyst, your main responsibility will be to evaluate a new personalization feature for Gemini. You will analyze the model’s performance in delivering relevant and helpful responses across various Google platforms. This role blends creative thinking and analytical skills, enabling you to design prompts inspired by personal experiences and critically assess the AI's responses.
Key Qualifications
- 📝 Fluency in Polish, both written and spoken, essential for this project.
- 🔐 Utilization of your personal Google account (not a testing account) for authentic evaluations.
- ⏰ Full-time availability, with flexibility for a global operation.
- 🧠 Strong analytical skills to interpret nuanced AI responses and assess personalization quality.
- 🎨 Experience in crafting creative prompts that incorporate personal context.
- 🔍 Familiarity with personalization concepts and identifying response inaccuracies.
- 🧐 Meticulous attention to detail, especially in spotting subtle response differences.
- ✍️ Ability to write clear and concise rationales referencing specific turns in conversations.
- 💡 Capable of providing constructive feedback with detailed annotations.
- 🤝 Excellent communication skills and collaborative spirit.
- 🧑💻 Self-motivated and able to work independently in a remote environment.
- 💻 Reliable access to a desktop/laptop and quality internet connection.
Job Responsibilities
- 🤖 Participate in a team assessing AI interactions and responses.
- 📝 Create and execute multi-turn conversation prompts involving personal info and experiences.
- ✅ Evaluate responses for appropriateness and clarity based on prompts.
- 🔎 Analyze responses for grounding issues, ensuring claims are substantiated.
- 🌐 Review integration of personal data in responses, ensuring natural delivery.
- 📊 Compare and rank two model responses to determine overall quality.
- 📝 Document clear, objective comparisons highlighting strengths and weaknesses.
- 🔍 Verify model performance by checking relevant data sources.
- 🧹 Maintain data hygiene by deleting evaluation conversations to preserve chat history.
Education & Experience
- 🎓 Bachelor's degree or equivalent experience in Policy, Law, Computer Science, or Linguistics.
- ⭐ Preferred experience in data annotation, AI evaluation, or content moderation roles.
Offer Details
- ⌚ Expected commitment: 4 to 40 hours per week, including at least 4 hours overlapping with PST.
- 📝 Contractor engagement for 3 months.
- 💰 Compensation: $15 per hour.
Evaluation Process
- 📄 Selected candidates will receive a Job Interest Form.
- 📝 Post-review, an assessment will be provided, to be completed within 24 hours.
- 🤝 Successful candidates will discuss pre-onboarding requirements.
After applying, expect a confirmation email with a link to complete your profile.