AI Tutor - Marathi
As an AI Tutor specializing in multilingual audio capabilities, you will play a pivotal role in enhancing xAI's Grok system to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts. Your contributions will directly impact Grok's global accessibility, enabling natural spoken interactions for users worldwide and bridging language barriers through accurate speech processing.
In this role, you will curate and annotate high-quality audio data, ensuring clear, natural spoken output and accurate representation of linguistic and prosodic details such as intonation, rhythm, and accent. Collaborating closely with technical staff, you will develop tasks that improve the AI's ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing. Additionally, you will work on enhancing annotation tools to streamline audio workflows.
The ideal candidate possesses native proficiency in Marathi, with exposure to diverse accents, dialects, or regional variations, and a minimum B2 level proficiency in English, with clear, natural vocal delivery suitable for audio recording purposes. Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages is essential. Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form, is required. Comfort in providing high-quality voice recordings and feedback on audio samples in multiple languages is also necessary.
Preferred qualifications include exceptional attention to linguistic nuance, auditory detail, and data quality beyond standard transcription work. A background in linguistics, speech sciences, cognitive science, or a related field, with demonstrated ability to analyze accent variation, pronunciation differences, and multilingual speech patterns, is advantageous. Experience working with speech/audio datasets, annotation workflows, or AI training data, including knowledge of training voice models and understanding how data quality impacts model performance, is highly desirable. Professional experience in voice work, such as voice acting, voice recording, or podcasting, demonstrating attention to clarity and recording quality, is also beneficial.
Compensation for this role ranges from $35 to $45 per hour, depending on factors including relevant experience, skills, education, geographic location, and qualifications. Benefits vary based on employment type, location, and jurisdiction, with eligible U.S.-based positions including health insurance, 401(k) plan, and paid sick leave. Specific details and role-specific information will be provided during the interview process.
xAI fosters a collaborative and innovative work environment, emphasizing engineering excellence and a flat organizational structure. Employees are expected to be hands-on, contributing directly to the company's mission, with leadership opportunities given to those who show initiative and consistently deliver excellence. Strong communication skills are valued, enabling concise and accurate knowledge sharing among teammates.