AI Tutor - Vietnamese
As an AI Tutor specializing in Vietnamese at xAI, you will play a pivotal role in enhancing our AI system, Grok, by training it to excel in voice interactions, speech recognition, and auditory experiences across diverse languages and cultural contexts. This position is integral to our mission of creating AI systems that accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence, operating with a flat organizational structure where all members are hands-on and contribute directly to the company's objectives.
In this role, your primary responsibilities will include using proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages. You will support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details such as intonation, rhythm, and accent, and adherence to professional audio standards. Additionally, you will collaborate with technical staff to develop tasks that improve the AI's ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing, as well as work with them to improve annotation tools for efficient audio workflows.
The ideal candidate will possess native proficiency in Vietnamese, with exposure to diverse accents, dialects, or regional variations, and proficiency in English (minimum B2 level) with clear, natural vocal delivery and pronunciation suitable for audio recording purposes. Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages is essential. Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form, as well as transcribe audio with high accuracy across accents and varying audio quality, is required. Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages, along with strong comprehension skills and the ability to make independent judgments on ambiguous or varied audio material, including noisy or accented speech, are also necessary. Strong communication, interpersonal, analytical, detail-oriented, and organizational skills, with the ability to articulate audio-related feedback effectively, are crucial. A commitment to developing AI that masters sophisticated multilingual audio capabilities is expected.
Compensation for this role ranges from $35 to $45 per hour, depending on factors including relevant experience, skills, education, geographic location, and qualifications. Benefits vary based on employment type, location, and jurisdiction. For eligible U.S.-based positions, benefits include health insurance, a 401(k) plan, and paid sick leave. Specific details and role-specific information will be provided during the interview process.
At xAI, we foster a culture that values curiosity, initiative, and direct contribution to our mission. Leadership is given to those who show initiative and consistently deliver excellence. We operate with a flat organizational structure, encouraging all employees to be hands-on and to contribute directly to the company's mission. This environment provides ample growth opportunities for individuals who appreciate challenging themselves and thrive on curiosity.