Home

Diwita Banerjee - AI / ML Enginner
[email protected]
Location: Fairfax, Virginia, USA
Relocation:
Visa:
Resume file: Diwita_Banerjee_AI_ML_1756489196207.pdf
Please check the file(s) for viruses. Files are checked manually and then made available for download.
Diwita Banerjee
AI/ML Engineer
+17035856489 | [email protected] | LinkedIn | GitHub | Open to Relocate
PROFESSIONAL SUMMARY
AI/ML Engineer with 5+ years of experience designing and deploying production-grade machine learning solutions in financial
services, healthcare, and enterprise domains. Skilled in Large Language Models (LLMs), Retrieval-Augmented Generation (RAG),
Natural Language Processing (NLP), and MLOps. Experienced in fine-tuning Transformer models, building scalable APIs, and
deploying AI systems on AWS, GCP, and Azure. Delivered LLM-powered pipelines that improved document retrieval accuracy by
30% and reduced inference latency by 40%. Automated ML workflows with Airflow, MLflow, Jenkins, and Docker, cutting
deployment cycles by 50%. Strong background in regulatory-compliant AI, feature engineering, and real-time analytics. Adept at
collaborating with cross-functional teams to drive business impact through AI adoption.
TECHNICAL SKILLS
Languages & IDEs: Python, SQL, Shell Scripting, MATLAB, Jupyter Notebook, VS Code, Google Colab, SSMS
Frameworks & Libraries: Pandas, NumPy, Scikit-learn, TensorFlow, Keras, PyTorch, XGBoost, Matplotlib, Seaborn, NLTK,
spaCy, Sentence-BERT, Hugging Face Transformers, LangChain
Machine Learning: Logistic Regression, Linear Regression, Decision Trees, Random Forests, SVM, Naive Bayes, A/B
Testing, Model Evaluation, Feature Engineering
Deep Learning & NLP/LLMs: CNN, RNN, LSTM, BERT, RoBERTa, GPT-3/4, LLMs, Prompt Engineering, RAG pipelines,
Fine-tuning (LoRA/PEFT)
MLOps & Deployment: Docker, Kubernetes (EKS), FastAPI, Flask, MLflow, Jenkins, Terraform, Airflow, PySpark, Model Drift
Detection, Usage Metrics Dashboards
Cloud & Visualisation: AWS (Lambda, EC2, S3, RDS, SageMaker, SQS, SNS, CodeDeploy, CloudWatch, API Gateway),
GCP, Azure, Tableau, Power BI
Databases & Tools: PostgreSQL, MySQL, MongoDB, Cassandra, Redis, Neo4j, SQL Server, SQLAlchemy
Statistical Techniques: Hypothesis Testing, Data Modeling, Data Visualization, Experiment Design, A/B Testing
Collaboration Tools: Agile Development, Requirements Gathering, Stakeholder Engagement, AI Insights Visualization
PROFESSIONAL EXPERIENCE
Freddie Mac | AI/ML Engineer
Jun 2024 Present
Designed and implemented LLM-powered RAG pipelines using FAISS and Hugging Face, increasing financial document
retrieval accuracy by 30%.
Built and deployed FastAPI-based microservices on AWS EKS and Lambda, reducing model inference response time by 40%
and improving scalability.
Fine-tuned Transformer models, including BERT and RoBERTa, on compliance-related datasets, improving contextual
relevance by 25%.
Automated end-to-end ML lifecycle management using Airflow, MLflow, Jenkins, and Docker, shortening deployment cycles by
50%.
Developed monitoring dashboards with PySpark and AWS CloudWatch for drift detection and performance analysis of
deployed models.
Collaborated with quantitative analysts and compliance officers to deliver regulator-compliant AI solutions with transparent
auditability.
George Mason University | AI Research Assistant
March 2024 Nov 2024
Built fairness and bias evaluation pipelines for multimodal AI models, including Stable Diffusion, LLaVA, and InstructBLIP,
uncovering demographic disparities.
Developed a medical VQA system using MIMIC-IV EHR data that reduced clinician query resolution time by 25% in controlled
trials.
Optimised large-scale training and inference workflows on the Hopper HPC cluster, reducing runtime by 40%.
Conducted federated multimodal experiments for privacy-preserving healthcare AI and improved model generalization in
clinical settings.
Benchmarked multiple Vision-Language Models (VLMs) and contributed results to academic publications.
Co-authored ongoing research papers in responsible and federated healthcare AI.
Cognizant Technology Solutions | Machine Learning Developer, Salesforce CRM
Nov 2021 Jul 2023
Built predictive lead/case/renewal scoring models with Einstein Prediction Builder and automated nightly training and scoring
via Apex batch and queueable jobs.
Delivered prescriptive recommendations using Einstein Discovery and Next Best Action, boosting SLA compliance and agent
productivity.
Develop CRM Analytics datasets and dashboards to monitor win-rate lift, adoption, precision/recall, and model drift, with alerts
for threshold deviations.
Integrated external ML endpoints through MuleSoft and Named Credentials with retry, timeout, and audit logging, ensuring
secure and reliable data exchange.
Implemented event-driven and batch pipelines with Platform Events and REST APIs for near real-time updates and
high-volume processing.
Automated CI/CD pipelines with Git and Copado, added unit/integration tests for high coverage, and authored runbooks to
streamline monitoring and rollback.
Recognized with the Working as One (2023) and Always Striving & Never Settling (2022) awards.
Larsen & Toubro Infotech | AI Developer Full Stack Applications
Jun 2020 Oct 2021
Engineered AI-driven predictive analytics modules using Spring Boot, Python, and Oracle SQL, reducing backend response
latency by 39%.
Developed intelligent data-driven UI components with React.js, improving usability and long-term scalability.
Automated defect root cause analysis workflows, reducing issue resolution time by 30%.
Designed APIs for scalable integration of AI models into enterprise applications.
Collaborated with business stakeholders to align AI feature development with operational requirements.
Integrated backend analytics with enterprise dashboards for real-time monitoring and decision-making.
RESEARCH & PROJECTS
MedBLIP Medical Image Captioning (GMU, Spring 2025)
Fine-tuned BLIP on the ROCO dataset for medical imaging captioning tasks.
Achieved benchmark improvements across CIDEr, SPICE, and BERTScore.
Compared performance with BLIP-2, InstructBLIP, and Gemini models.
CloudMart Multicloud AI E-Commerce Platform (Bootcamp, Mar 2025)
Designed and deployed a GPT-powered shopping assistant across AWS, GCP, and Azure using Terraform, Kubernetes, and
Docker.
Implemented real-time product discovery, intelligent Q&A, and analytics pipelines.
Built cross-cloud ETL-style AI pipelines with Git-based CI/CD, improving scalability and observability.
News Summarisation & Simplification (GMU NLP Project, Fall 2024)
Built an LSTM-based summarizer and T5 simplification model using Hugging Face Transformers and TensorFlow.
Trained on the WikiAuto dataset and achieved BLEU 0.22 and Flesch-Kincaid readability 8.31.
Delivered an end-to-end NLP pipeline to improve accessibility and comprehension of complex news content.
Movie Recommendation (IJARESM, 2022)
Developed a BERT-based semantic search and recommendation engine on the CMU Movie Summaries dataset.
Enhanced semantic retrieval and recommendations with a 15% improvement in accuracy.
Published results in the IJARESM research journal.
EDUCATION
Master of Science in Computer Science (Machine Learning Concentration)
George Mason University, Fairfax, VA
Coursework: Algorithms, AI, Data Mining, Machine Learning, Deep Learning, Advanced NLP
Aug 2023 May 2025
Honours & Activities: Led and Won 2nd Position in GDG-GMU Hackathon; research on RCRS for Prompt Injection
Mitigation
Keywords: continuous integration continuous deployment artificial intelligence machine learning user interface javascript business intelligence sthree Colorado Virginia

To remove this resume please click here or send an email from [email protected] to [email protected] with subject as "delete" (without inverted commas)
[email protected];6060
Enter the captcha code and we will send and email at [email protected]
with a link to edit / delete this resume
Captcha Image: