AI Engineer - Classifiers, Media Intelligence & Voice R&D

Remote

Full time

Senior

Remote, United States

PythonAI EngineeringMachine Learning EngineeringAI EngineerMachine LearningR&DaudioClassificationContent Moderationimage classificationintelligencemediaPyTorchTensorFlowtext

Job description

This is a remote position.

Our client is looking for an innovative and driven AI Engineer to join their team. A leader in media intelligence and AI-driven content creation, they have recently expanded their work in AI voice and image technologies, driving the development of the next generation of cutting-edge products. This role will focus on the creation, classification, and organization of massive volumes of AI-generated media, along with spearheading R&D into AI voice and audio generation and advanced image intelligence capabilities.

Job Description
Responsibilities:

Design, train, and deploy classification models for content pipeline, including style detection, quality scoring, content moderation, filtering, and semantic categorization of generated media.
Develop and maintain automated tagging and organization systems for the media library: extracting attributes, detecting visual features, clustering similar content, and enabling intelligent search.
Build and optimize training data pipelines: create annotation tooling, curate datasets, establish active learning loops, and ensure high-quality labeled data.
Lead R&D into AI voice and audio generation, including voice cloning, text-to-speech, and audio synthesis; prototype integrations and create a production-ready pathway from research to features.
Research and prototype image intelligence technologies such as face/body analysis, pose estimation, style transfer, and image-to-image consistency.
Develop evaluation frameworks to measure the accuracy of classifiers, the quality of generation models, and model drift over time.
Optimize inference pipelines for performance, cost, and latency—incorporating batching, quantization, caching, and model serving strategies.
Integrate with GPU compute infrastructure and deliver models via production APIs.

Requirements
3+ years of experience building and deploying machine learning models in production, particularly in classification, tagging, or content understanding.
Hands-on experience with model training, including dataset curation, experimenting with architectures, tuning hyperparameters, and debugging.
Strong background in image classification and computer vision techniques (e.g., CNNs, vision transformers, CLIP).
Experience or demonstrated interest in voice/audio AI (e.g., text-to-speech, voice cloning, audio classification).
Proficiency in Python, with experience in PyTorch or TensorFlow.
Experience with building data labeling pipelines, annotation workflows, or active learning systems.
Understanding of model serving in production environments, including REST APIs and latency optimization.
Qualifications:
Bachelor’s degree or higher in Computer Science, Engineering, or related field.
Experience in AI/ML, particularly in content classification, tagging, and media organization systems.
Proven experience with Python and ML frameworks like PyTorch or TensorFlow.
Strong communication skills to collaborate with R&D teams and integrate new technologies into production.

Benefits
100% remote, full-time role.
Flexible work hours.
Competitive salary and comprehensive benefits package.
Opportunity for career advancement and personal development.
Work on high-impact projects with cutting-edge AI technologies.

Good match

We match every vacancy against your profile and show a fit score — so you instantly know which ones are worth applying to. Sign up and create a resume — it's free.

Not enough data to estimate a salary range for this role in this region yet.

AI Engineer - Classifiers, Media Intelligence & Voice R&D

Job description

This is a remote position.

Job DescriptionResponsibilities:

Requirements

Qualifications:

Benefits

Job Description
Responsibilities: