Retail Product Performance Dashboards
→
Summary
Developed comprehensive dashboards and analytical models for retail product performance and causal inference.
Highly accomplished Analytics Professional with over 4 years of experience specializing in Risk Management and data-driven insights. Proven expertise in Python, SQL, and advanced ML algorithms, excelling in data cleaning, exploratory data analysis, feature engineering, and cutting-edge NLP techniques. Adept at unearthing actionable insights to drive strategic decision-making and organizational growth. Seeking a dynamic role to apply a strong foundation in Applied Statistics and AI/ML, delivering significant business impact.
Data Scientist & AI/ML Consultant
→
Summary
Provided expert data science and AI/ML consulting services to various organizations, focusing on leveraging advanced analytics for actionable business insights.
Highlights
Developed and deployed an AI-powered Insurance Policy Insights & Comparison System on Hugging Face using Gradio, implementing RAG with Mistral 7B and FAISS for efficient, semantic comparison and extraction of information from diverse policy documents.
Delivered actionable business insights for a global ride-sharing company by leveraging BigQuery, BigQueryML, and Looker, optimizing operational strategies.
Completed intensive, industry-aligned programs to advance expertise in Machine Learning, Natural Language Processing, and foundational Large Language Models (LLMs).
Currently developing an NLP system for scene-wise dialogue extraction and character development analysis, utilizing sentiment and emotion shifts from theatrical works (e.g., Tamasha).
Data Scientist - Digital Marketing Analytics
→
Summary
Spearheaded advanced analytics initiatives to optimize digital marketing performance and drive revenue growth for diverse clients.
Highlights
Spearheaded the design and implementation of a multiclass CatBoost model, integrating advanced embeddings to accurately classify clothing purchase categories, enhancing targeting precision for over 500M+ customers.
Achieved 91% AUC by building a robust model with iterative feature selection, effectively managing concept drift, and integrating an attention layer with embeddings, leading to precise targeting for 500M+ customers across class-wise ventiles.
Generated INR 300K in incremental monthly revenue for 8+ clients by deploying highly accurate model predictions, directly impacting business growth.
Pioneered data-driven digital marketing solutions tailored for FMCG, Real Estate, and Telecom sectors.
Collaborated cross-functionally with clients to develop and execute ML-based audience strategies, optimizing campaigns through A/B testing, which resulted in a significant INR 3M+ increase in monthly net revenue and improved Return on Ad Spend (ROAS).
Data Scientist - Acquisition Risk Analytics
→
Summary
Led critical data science initiatives focused on mitigating offer abuse and enhancing profitability within acquisition processes.
Highlights
Led a critical enterprise-wide initiative to combat offer abuse, significantly impacting 10% of 2023 acquisitions and enhancing overall profitability.
Engineered a novel, personalized offer data-driven definition to detect 1.3X more intentional offer exploiters, replacing a 4-year-old legacy system and accelerating identification.
Revitalized predictive gamer models by redefining offer exploitation flags and integrating clickstream signals, achieving an 8% Year-over-Year improvement in AUC.
Drove cross-functional collaboration with marketing, finance, and product teams, managing rapid iterations to implement ROI-driven model cutoffs across 100+ cohorts, which enhanced policy controls and generated $48.2M in savings in 2023.
Developed and deployed a comprehensive MIS dashboard for daily application and gaming impact tracking, resulting in 20+ hours of monthly time savings.
Successfully implemented digital signal analysis to detect offer abuse, leading to $4.7M in direct savings.
Designed and validated 13 behavioral indicators from diverse digital sources (pages, URLs, IPs, devices) to identify gaming tendencies through rigorous hypothesis testing.
Leveraged TF-IDF and word embeddings to identify high-risk pages, then developed a robust Random Forest classifier for detection.
Processed JSON data to generate over 600 variables, employing XGBoost to pinpoint the top 30 critical gaming features.
Constructed a neural network leveraging 43 digital signals to accurately predict digital gaming probability for new acquisitions.
Analytics Engineer
→
Summary
Delivered end-to-end cloud-based analytics solutions to optimize supply chain and operational efficiency for a leading CPG company.
Highlights
Designed and delivered an end-to-end cloud-based analytics solution for a leading US/UK CPG company, leveraging Google Cloud Platform (GCP) and Looker to enhance operational efficiency.
Developed an ML model on GCP to accurately predict order lead times, providing quantifiable financial insights to inform strategic decision-making.
Automated ML predictions using Airflow DAGs to provide daily estimates of delivery probability, inventory levels, and stockout losses with 50% and 95% confidence intervals, improving supply chain foresight.
Identified and mitigated $5-6 million in 120-day risk-adjusted revenue losses attributed to stockouts.
Created dynamic Looker dashboards to visualize key performance indicators (KPIs), stockout alerts, and financial losses, significantly enhancing business visibility and accelerating data-driven decision-making.
Data Scientist Intern
→
Summary
Contributed to the development and deployment of data science solutions for a large telecommunications subscriber base.
Highlights
Strategized and executed Data Science solutions impacting 270 million subscribers across diverse business use cases, driving significant value.
Developed and deployed Data Science solutions for critical business areas including prepaid-to-postpaid upselling, churn prediction, and sentiment analysis.
Enhanced model performance by engineering features, optimizing imputation, and fine-tuning, resulting in a 1.4X improvement in churn prediction lift and 1.5X in upselling lift.
Utilized Explainable AI (XAI) to pinpoint key attrition drivers for high-risk customers, enabling the development of targeted and effective retention strategies.
→
Bachelor of Technology
Electrical Engineering
Grade: 9.13/10
→
Class XII
Science
Grade: 88.8%
→
Class X
General Studies
Grade: 10/10
Awarded By
Dphi
Achieved 9th rank in the Dphi Data Sprint 22 Data Science Hackathon.
Awarded By
HackerRank
Earned 5-star badges in Python and SQL on HackerRank, demonstrating expert proficiency.
Awarded By
Chegg
Delivered over 100 high-quality solutions as a Statistics & Probability Expert on Chegg with a 95%+ Content Feedback Score.
Awarded By
National Institute of Technology, Kurukshetra
Led the Institute Table Tennis Team for two consecutive years, securing multiple awards.
Awarded By
JEE Mains
Accomplished rank among the top 1.2% of all students (1.4 million) who appeared for JEE Mains- 2017.
Fluent
Native
Issued By
Northwestern University
Issued By
Udemy
Issued By
Udemy
Issued By
Coursera
Issued By
DataCamp
Issued By
DataCamp
Python, SQL.
Sklearn, Tensorflow, Keras, CatBoost, Random Forest, XGBoost, Descriptive Analytics, Predictive Analytics, Machine Learning, Statistics, A/B Testing, Causal Analytics, Anomaly Detection, Clustering, Model Monitoring, Recommendation Systems, Neural Networks, Hierarchical Attention Models, Ensemble Learning.
Text Representation, Word Embeddings, Topic Modelling, Attention Models, Transformer, BERT, Spacy, RobBERT, Hugging Face, RAG, Mistral 7B, FAISS, LLMs.
Google Cloud Platform (GCP), Looker, PySpark, Airflow, BigQuery, BigQueryML, FastAPI, Streamlit, PyWebIO, Gradio.
Data Cleaning, Exploratory Data Analysis (EDA), Feature Engineering, Data Pipelines, JSON Parsing.
Applied Statistics, Hypothesis Testing, Statistical Tests, Confidence Intervals.
Looker Dashboards, MIS Dashboards, KPI Tracking, Business Visibility.
Iterative Development, Agile, Cross-functional Collaboration.
Table Tennis.
Acting, Poetry.
→
Summary
Developed comprehensive dashboards and analytical models for retail product performance and causal inference.
→
Summary
Designed and implemented a robust recommendation engine using various filtering techniques.
→
Summary
Developed an NLP solution for classifying and summarizing Dutch news articles.
→
Summary
Implemented advanced analytics for user segmentation and churn prediction using ride-share data.