Recruiter Analysis
Senior Data Scientist | MLOps Engineer
The candidate is an experienced senior data scientist with a clear focus on production-grade machine learning and MLOps. Their background shows deep involvement in building and operating end-to-end pipelines across AWS, GCP and Azure, with practical experience in orchestration (Airflow, Argo, Kubeflow), containerization (Docker, Kubernetes/AKS), model versioning (MLflow, DVC), and monitoring (Prometheus, Grafana, ELK). They combine traditional analytics (A/B testing, forecasting, SQL-driven feature engineering) with modern GenAI and NLP work (BERT, OpenAI GPT-4, LangChain), and have shipped business-facing artifacts such as dashboards, automated reports and production chatbots. Domain exposure spans fintech, mortgage document processing, retail pricing, healthcare and CRM integrations, which supports adaptability to enterprise contexts. Strengths include pragmatic MLOps execution, stakeholder management, and the ability to operationalize models at scale. Areas to probe in interviews: depth on architecture choices for latency-sensitive serving, specifics around feature store or metadata management, and examples of trade-offs made in production (cost vs. latency vs. accuracy). Overall, well-suited for senior engineering roles that require both hands-on MLOps and cross-functional leadership.
Technical Profile
13+ years experience building data-driven solutions across Fintech, retail, mortgage and healthcare domains. Strong in Python, SQL, BigQuery, Snowflake, Databricks and end-to-end MLOps including Airflow, Kubeflow, Argo and Kubernetes. Experience spans NLP, time-series forecasting, deep learning, model monitoring and CI/CD for production ML.
The candidate demonstrates consistent ownership of enterprise-scale ML systems: data ingestion, feature engineering, in-database training, model deployment and monitoring. Projects combine classical analytics (forecasting, A/B testing dashboards) with modern GenAI and MLOps (RAG chatbots, automated retraining pipelines). Work shows multi-cloud deployments and experience operationalizing models with observability.
Python, SQL, BigQuery, Snowflake, Databricks, PySpark, TensorFlow/PyTorch, BERT, OpenAI GPT-4, LangChain, Airflow, Kubeflow, Argo, Docker, Kubernetes, MLflow, DVC, AWS, GCP, Azure, Tableau, Power BI, R Shiny.
Skills
Primary
Data ScienceMachine LearningMLOpsPythonSQLBig Data AnalyticsModel DeploymentFeature Engineering
Secondary
Natural Language Processing (NLP)Time Series ForecastingDeep LearningComputer VisionModel MonitoringCI/CDStakeholder EngagementAgile/ScrumTest Driven Development (TDD)Data Visualization
Frameworks
TensorFlowPyTorchKerasMLlibXGBoostBERTOpenCVCaffeFlaskLangChainR Shiny
Databases
BigQuerySnowflakeAWS RedshiftCloud SQLSQL ServerRDS
Cloud
AWSGCPAzure
Work Experience
Sr AI Data Scientist/MLops Engineer
Intuit
Owned and implemented end-to-end analytics and MLOps solutions for enterprise-scale products. Work spans cohort and GN S analyses, A/B experiment implementation and visualization, automated retraining pipelines, multi-cloud model deployment, generative AI initiatives, ETL and data pipeline optimization, forecasting and pricing models, and operational reporting across finance, marketplace, and mortgage domains.
PythonSQLBigQueryBigQuery MLSnowflakeDatabricksPySparkApache AirflowKubeflow PipelinesArgo WorkflowsKubernetesAKSAzureAWSGCPAWS GlueAWS LambdaMLflowDVCPrometheusGrafanaELK StackCloudWatchDBTOpenAI GPT-4LangChainR ShinyTableauPower BIQuickSightSSRSFlaskAI Foundry
- Developed automated workflows for model retraining and deployment triggered by data drift and performance metrics using Airflow and Kubeflow Pipelines.
- Designed and deployed end-to-end ML pipelines using BigQuery ML for in-database training and inference on TB+ datasets.
- Built and maintained containerized ML applications using Docker and deployed on Kubernetes/AKS with Argo Workflows for orchestration.
- Implemented monitoring and logging for models and infrastructure using Prometheus, Grafana, ELK Stack and Azure Monitor.
- Created forecasting and price optimization solutions (SARIMA/ARIMA) and integrated R Shiny dashboards for financial planning and pricing, delivering measurable business impact (retail price optimization noted as +10% profitability).
- Developed GenAI chatbots and RAG-based knowledge search using OpenAI GPT-4 and LangChain; integrated copilots into Power BI with Microsoft Copilot Studio.
- Optimized ETL and data pipelines in Snowflake, BigQuery and with AWS Glue; automated ingestion using Lambda and serverless patterns.
- Collaborated with cross-functional teams to implement MDM processes and integrate analytics with CRM and ERP systems (Salesforce, Microsoft Dynamics 365).
- Automated marketplace health metrics pipeline using SQL and dbt and improved BI query performance using Redshift and query optimization.
Senior Data Scientist
Apple (Sunnyvale)
Built NLP and data engineering solutions to identify and redact personal health information (PHI) from unstructured customer care data, exposure of PII detection APIs, and operational reporting improvements including Power BI and Palantir Foundry integrations.
BERTXGBoostPythonSQLPower BIPalantir FoundryETL toolsMicrosoft Dynamics 365
- Implemented BERT-based NER models to detect and redact PHI from raw unstructured customer care data and exposed APIs for detection.
- Collaborated with data engineers to design ETL processes and wrote optimized SQL for data extraction and analytical needs.
- Built and validated NLP models and classical ML models (XGBoost) for information extraction and classification tasks.
- Performed univariate and multivariate analysis to uncover patterns and inform model features.
- Designed Power BI dashboards integrated with Palantir Foundry for real-time supply chain and operations analytics.
- Worked with business teams to customize Microsoft Dynamics 365 reporting and improve CRM reporting capabilities.
Projects
IntuitAssist (IAQB) Tableau Dashboard
Sr AI Data Scientist/MLops Engineer
Maintained and enhanced the IntuitAssist Tableau dashboard, integrating analytics for live product teams and enabling experiment review and cohort analysis.
TableauSQLBigQueryDBT
- Dashboard maintenance and enhancement
- A/B experiment reporting integration
- Cohort and GN S analysis
- Performance visualization for WinRoom experiment reviews
OIPRO A/B Experiment Framework and Dashboard
Lead Developer
Built the A/B testing experiment from scratch, instrumented results tracking and developed a Tableau dashboard for stakeholders to review experiment performance.
SQLTableauPython
- Experiment instrumentation
- Automated result aggregation
- Interactive experiment performance dashboards
Automated Model Retraining & Deployment Pipelines
MLOps Engineer
Automated end-to-end model retraining and deployment triggered by drift and performance metrics using Airflow, Kubeflow and Argo Workflows; integrated monitoring and CI/CD.
Apache AirflowKubeflow PipelinesArgo WorkflowsMLflowDVCKubernetesDocker
- Automated retraining triggers
- Model versioning and reproducibility
- CI/CD integration for models
- Production monitoring and alerting
Generative Reports Engine
Developer / Designer
Designed a generative reporting engine that automates creation of financial summaries using Azure-hosted LLMs and structured tabular data.
AzureOpenAI GPT-4PythonPower BI
- LLM integration with structured data
- Automated report generation
- Power BI integration with Copilot style interfaces
Mortgage Document Processing (AI Foundry)
Data Scientist
Built document processing pipelines using AI Foundry Automation Studio to classify, extract entities and OCR mortgage documents for downstream analytics.
AI FoundryOCRPython
- Document OCR and classification
- Entity extraction
- Pipeline automation in AI Foundry
PHI Redaction with BERT (Apple)
Senior Data Scientist
Implemented BERT-based NER models to identify and redact personal health information and sensitive customer data from unstructured support interactions; exposed APIs for detection and redaction.
BERTXGBoostPythonAPIsSQL
- NER model training and evaluation
- API development for detection and redaction
- Data extraction and ETL for model inputs