| Yaggadi Sreeja - Data Scientist |
| [email protected] |
| Location: Lilburn, Georgia, USA |
| Relocation: yes |
| Visa: |
| Resume file: sree Resume (1)_1772692701730.docx Please check the file(s) for viruses. Files are checked manually and then made available for download. |
|
SREEJA YAGGADI
DATA SCIENTIST Email: [email protected] | Mobile: +1(334) 781-3436 | Location: Georgia, USA SUMMARY Innovative and analytical Data Scientist with over 4years of professional experience designing, developing, and deploying data-driven, AI-powered solutions across healthcare, banking, and enterprise domains. Skilled in building end-to-end machine learning pipelines, automating ETL workflows, and integrating predictive models using advanced analytics, MLOps, and cloud technologies. Proficient in Python, R, and SQL, with expertise in statistical modelling, predictive analytics, and Big Data ecosystems including Spark, Hive, and Kafka. Adept at leveraging AWS SageMaker, Azure ML, and GCP Vertex AI to deploy and monitor scalable ML models for real-time insights. Hands-on with modern MLOps frameworks such as Docker, Kubernetes, MLflow, and Jenkins to streamline model deployment and lifecycle management. Experienced in creating interactive dashboards and KPI-driven insights using Tableau and Power BI to support data-driven business strategies. Recognized for delivering measurable results from reducing fraud losses by 20% to improving data pipeline efficiency by 35%. Passionate about advancing Generative AI, Large Language Models (LLMs), and automated analytics systems to transform data into intelligence that powers smarter, faster decisions across the organization. SKILLS Programming: Python, R, SQL, Java, Scala, C, Shell Scripting Databases: Oracle, MySQL, SQL Server, Teradata, MongoDB, Cassandra, Snowflake ETL & Data Engineering: Informatica, DataStage, Talend, Airflow, Control-M, dbt Machine Learning & AI: Regression, Classification, Clustering, Random Forest, XGBoost, Deep Learning (TensorFlow, PyTorch), NLP, Generative AI (Lang Chain, OpenAI, Hugging Face) Big Data & Cloud MLOps & Tools: Spark, Hive, Kafka, AWS (S3, Redshift, SageMaker), Azure (Data Factory, ML), GCP (Big Query, Vertex AI) Visualization: Tableau, Power BI, Seaborn, Matplotlib, ggplot2, Plotly Tools & Platforms: Jupyter, VS Code, GitHub, Databricks, Snowflake, Postman, Excel, Jira, Confluence EXPERIENCE Wellstar Health System, Marietta, Georgia, USA Jan 2025 Current Senior Data Scientist Designed and deployed predictive, NLP, and deep learning models using Python, Azure ML, TensorFlow, and PyTorch to enhance patient engagement and operational decision-making. Automated ETL, validation, and monitoring pipelines with SQL, Spark, and Airflow, improving data quality and pipeline efficiency by 35%. Developed interactive dashboards in Power BI and Tableau for executives, tracking KPIs, clinical outcomes, and financial trends. Implemented MLOps frameworks using Docker, Kubernetes, MLflow, and Jenkins for seamless model deployment and version control. Utilized Generative AI (OpenAI, Lang Chain) for summarizing patient feedback and automating report generation. Partnered with data engineering, analytics, and IT teams to operationalize ML models across departments. Mentored junior data scientists and analysts on model evaluation, statistical design, and Python best practices. DBS Bank, India Jul 2021 - Jul 2023 Data Scientist Built AI-powered fraud detection, credit risk, and customer segmentation models using Python, PySpark, XGBoost, and Scikit-learn, reducing fraud losses by 20%. Deployed automated ML pipelines on AWS SageMaker and Lambda, accelerating retraining and deployment cycles. Engineered data lakes and analytical pipelines with Hive, Spark, and Redshift for large-scale processing of structured and unstructured data. Created real-time Tableau dashboards delivering actionable insights to leadership and compliance teams. Collaborated with data governance and product teams to ensure model fairness, transparency, and regulatory compliance. Introduced A/B testing frameworks to evaluate marketing campaign effectiveness and model impact. Contributed to data strategy initiatives, improving data ingestion and automation standards organization-wide. Carrier , India Apr 2020 - Jun 2021 Data Analyst Conducted data mining, analysis, and forecasting using Python, SQL, and regression algorithms for demand and pricing optimization. Automated recurring reports and validation workflows with ETL scripts, improving reporting turnaround by 30%. Built interactive Tableau dashboards visualizing supply chain, sales, and operational KPIs. Partnered with business stakeholders to define data metrics, KPIs, and reporting standards for executive visibility. Streamlined data migration processes and improved data governance by implementing QA checkpoints and anomaly detection scripts. Collaborated with the IT team to enhance data pipelines and warehouse performance across regional systems. EDUCATION Master of Science in Computer Science (Auburn University at Montgomery, Alabama, USA) Graduated with a strong focus on Data Analytics, Machine Learning, and Cloud Computing, completing multiple hands-on projects involving predictive modelling, NLP, and big data processing. CERTIFICATIONS AWS Certified Machine Learning Specialty Microsoft Certified: Azure Data Scientist Associate (DP-100) Google Cloud Professional Data Engineer (in progress) Keywords: cprogramm quality analyst artificial intelligence machine learning business intelligence sthree rlang information technology |