Senior MLOps Engineer Job at DeepRec.ai, San Jose, CA

YjF0cHhQa0Q4VS9WMXhGemh0SStiek1LVEE9PQ==
  • DeepRec.ai
  • San Jose, CA

Job Description

Senior MLOps Engineer

We are hiring for an MLOps Engineer for a fast-moving AI startup who are building a worldclass AI-powered video platform.

We are looking for a skilled and hands-on MLOps Engineer to join their growing team. You will play a critical role in deploying, scaling, and maintaining their machine learning infrastructure, supporting a range of tools that enable the controlled generation of high-quality animated videos.

Key Responsibilities

  • Design, deploy, and maintain scalable training and data-processing pipelines on distributed compute clusters (e.g., Slurm, Kubernetes, or cloud-native equivalents).
  • Optimize inference systems for latency and cost in a production setting.
  • Collaborate closely with ML researchers and engineers to productionize deep learning models.
  • Implement robust monitoring, logging, and alerting systems for model performance and infrastructure reliability.
  • Automate model testing, validation, and deployment processes across staging and production environments.
  • Ensure efficient usage of compute resources, including GPU clusters, and help identify bottlenecks or cost-saving opportunities.

Requirements

  • Proven experience in MLOps, ML infrastructure, or related roles.
  • Deep expertise in deploying and maintaining ML training pipelines on distributed systems.
  • Strong knowledge of inference optimization techniques, especially in reducing latency and cost at scale.
  • Proficiency with cloud platforms (AWS, GCP, Azure) and orchestration tools (Kubernetes, Docker).
  • Experience working with GPU scheduling, distributed training (e.g., PyTorch DDP), and model serving frameworks (e.g., Triton, TorchServe).
  • Familiarity with CI/CD for ML workflows.
  • Strong Python skills and experience with ML/DL frameworks like PyTorch or TensorFlow.

Bonus Points

  • Experience working in the creative media or animation industry.
  • Exposure to video processing, generative AI, or large-scale content production systems.
  • Experience collaborating with research teams or integrating research code into production pipelines.

Please apply for more information

Job Tags

Similar Jobs

Rheem Manufacturing

Delivery Driver (Non-CDL) - Fairfield, NJ Job at Rheem Manufacturing

 ...HVAC solutions to contractors, multifamily industries, and commercial properties across the Northeast. Were looking for a reliable Delivery Driver (Non-CDL) to be an essential part of our team, delivering excellent service and embodying our company values. The ideal... 

R2 Global

Hybrid - Amazon Advertising Manager - $95,000 + Benefits - Miami, FL Job at R2 Global

 ...partnering with a fast-growing, eCommerce-driven fashion brand headquartered in sunny Miamiknown for their bold designs, cutting-edge marketing campaigns, and outstanding company culture. After five years of explosive growth, theyre expanding their team and looking for an... 

Riverside Medical Clinic

RN - SURGERY CENTER- Full Time Job at Riverside Medical Clinic

 ...facilitating the admission process for pre-scheduled patients to the Surgery Center. Reviews patient information for admissions and initiates...  ...acute care hospitals, behavioral health facilities, outpatient facilities and ambulatory care access points, an insurance offering... 

UNOX Inc

Assembler Job at UNOX Inc

The Assembler is responsible for the mechanical and electrical assembly/repair of any UNOX products as spareparts kit, ovens, hoods , stands or any type of pre-built configuration as well as the final testing. The Assembler needs to perform his job in agreement within...

Weed Man - Triad, NC

Fleet Manager- Mechanic Job at Weed Man - Triad, NC

 ...We are looking for a skilled Mechanic to maintain and repair machinery and maintain our vehicle fleet. You will be responsible for ensuring functionality and reliability of machines, engines and mechanical systems. An excellent mechanic must have manual dexterity and great...