Senior AL/ML Ops Engineer-II (Hybrid in Bangalore)

Smartsheet
Bangalore, INDIAPosted 6 March 2026

Job Description

<div class="content-intro"><p>For over 20 years, Smartsheet has helped people and teams achieve–well, anything. From seamless work management to smart, scalable solutions, we’ve always worked with flow. We’re building tools that empower teams to automate the manual, uncover insights, and scale smarter. But more than that, we’re creating space– space to think big, take action, and unlock the kind of work that truly matters. Because when challenge meets purpose, and passion turns into progress, that’s magic at work, and it’s what we show up for everyday.</p></div><p>Our India Global Capability Center isn't just supporting global operations—we’re leading global innovation. After scaling rapidly into a best-in-class hub, we deliver the product innovation and enterprise capabilities that accelerate our global growth, profitability, and scale. As we expand Smartsheet India, we’re searching for <strong>Senior AI/ML Ops Engineers</strong> who crave variety and ownership. You’ll have the opportunity to work across multiple teams and disciplines, building a versatile skillset while solving the complex challenges of a global platform.</p> <p><strong>You Will:</strong></p> <ul> <li>Designing, Developing and overseeing the strategy and architecture of scalable and reliable AI/ML Ops platforms / pipelines</li> <li>Model Deployment: Package and deploy AI/ML services to production, ensuring they are reproducible and interpretable</li> <li>CI/CD Pipeline Development: Design and implement automated CI/CD (Continuous Integration/Continuous Deployment) pipelines to accelerate model deployment using tools</li> <li>Infrastructure Management: Provision and optimize infrastructure for training and serving, utilizing Docker, Kubernetes, or serverless platforms</li> <li>Monitoring Observability : Implement post-deployment monitoring for model performance, data drift, and latency using tools. Experience in Monte Carlo is preferable</li> <li>Automation: Automate retraining and data pipeline workflows to ensure models stay accurate over time.</li> <li>Manage the deployment of foundation models, fine-tuning workflows, and Retrieval-Augmented Generation (RAG) stacks (Vector DBs, Knowledge Graph. Experience with AWS Bedrock is preferable</li> <li>Resource Optimization: Manage GPU/CPU utilization to minimize cloud costs while maintaining low-latency inference for users</li> <li>Collaboration: Work closely with data scientists, data engineers, and software engineers to bridge the gap between model development and production.</li> <li>Version Control Governance: Manage versioning for data, code, and models using tools like MLflow.</li> <li>Security Compliance: Implementing data security measures, ensuring compliance with data governance policies, and protecting sensitive data</li> <li>Technology Evaluation and Innovation: Staying abreast of emerging data technologies and exploring opportunities for innovation to improve the organisation’s data infrastructure</li> <li>Troubleshooting and Problem Solving: Diagnosing and resolving complex data-related issues, ensuring the stability and reliability of the data platform</li> <li>Perform other duties as assigned</li> </ul> <p><strong>You Have:</strong></p> <ul> <li>Enterprise SaaS software solutions with high availability and scalability</li> <li>Solution handling large scale structured and unstructured data from varied data sources</li> <li>Experience in building and maintaining AI/ML Ops platform systems ensuring scalability, reliability, efficiency and security</li> <li>Working with Product engineering team to influence designs with data, AI and analytics use cases in mind</li> <li>In depth experience in System design, AI/ML Frameworks and tools involving large Petabytes of data with Databricks Lakehouse ecosystem</li> <li>AI/MLOps workflows on Databricks , MLFlow, Mosaic AI Agent Framework, Unity Catalog, Vector Search, Knowledge Graph</li> <li>Knowledge of AI/ML frameworks like LangChain, LangGraph for AI/ML Ops ... (truncated, view full listing at source)