Senior MLOps Engineer Job at DeepRec.ai, San Jose, CA

YjF0cHhQa0Q4VS9WMXhGemh0SStiek1LVEE9PQ==
  • DeepRec.ai
  • San Jose, CA

Job Description

Senior MLOps Engineer

We are hiring for an MLOps Engineer for a fast-moving AI startup who are building a worldclass AI-powered video platform.

We are looking for a skilled and hands-on MLOps Engineer to join their growing team. You will play a critical role in deploying, scaling, and maintaining their machine learning infrastructure, supporting a range of tools that enable the controlled generation of high-quality animated videos.

Key Responsibilities

  • Design, deploy, and maintain scalable training and data-processing pipelines on distributed compute clusters (e.g., Slurm, Kubernetes, or cloud-native equivalents).
  • Optimize inference systems for latency and cost in a production setting.
  • Collaborate closely with ML researchers and engineers to productionize deep learning models.
  • Implement robust monitoring, logging, and alerting systems for model performance and infrastructure reliability.
  • Automate model testing, validation, and deployment processes across staging and production environments.
  • Ensure efficient usage of compute resources, including GPU clusters, and help identify bottlenecks or cost-saving opportunities.

Requirements

  • Proven experience in MLOps, ML infrastructure, or related roles.
  • Deep expertise in deploying and maintaining ML training pipelines on distributed systems.
  • Strong knowledge of inference optimization techniques, especially in reducing latency and cost at scale.
  • Proficiency with cloud platforms (AWS, GCP, Azure) and orchestration tools (Kubernetes, Docker).
  • Experience working with GPU scheduling, distributed training (e.g., PyTorch DDP), and model serving frameworks (e.g., Triton, TorchServe).
  • Familiarity with CI/CD for ML workflows.
  • Strong Python skills and experience with ML/DL frameworks like PyTorch or TensorFlow.

Bonus Points

  • Experience working in the creative media or animation industry.
  • Exposure to video processing, generative AI, or large-scale content production systems.
  • Experience collaborating with research teams or integrating research code into production pipelines.

Please apply for more information

Job Tags

Similar Jobs

Roman Catholic Diocese of Orange

Hospitality and Events Assistant Job at Roman Catholic Diocese of Orange

POSITION TITLE: Hospitality and Events Assistant JOB CLASSIFICATION: Part Time Non-Exempt DEPARTMENT/PROGRAM: Campus Hospitality REPORTS TO: Director of Hospitality SCHEDULE: Approximately 15 hours per week, with evening and weekends PAY RANGE: $22...

ProCare Therapy

Account Executive Job at ProCare Therapy

 ...relationships with clients and candidates, and work within a supportive, high-energy team that...  ..., and much much more Work-from-home flexibility, which you can earn based on your...  ...commissions and room for growth I'd love to chat with you about the possibilities at... 

KellyMitchell Group

Brand Bilingual Copywriter Job at KellyMitchell Group

Job Summary: Our client is seeking a Brand Bilingual Copywriter to join their team! This position is located in McLean, Virginia. Duties: Review, edit and approve changes to raw translated content in a way that ensures brand personality and voice shines through...

VOTEGTR

Graphic Designer Job at VOTEGTR

 ...Experience working with political campaigns or conservative causes is a plus. Position Details: ~ Part-time, flexible hours. ~ Remote work available. ~ Compensation based on experience. ~1099 contract How to Apply: ~ Please send your resume, portfolio,... 

MLR.org

Certified Registered Nurse Anesthetist | CRNA Job at MLR.org

Certified Registered Nurse AnesthetistUp to $30,000 Sign-On Bonus Available / Up to $10,000 Relocation Assistance Available / Full...  ...for a skilled and compassionate Certified Registered Nurse Anesthetist (CRNA) to join a dynamic healthcare team in beautiful New Mexico. As...