Home

Gaganavarshi kanumuru - Data Engineer
[email protected]
Location: Philadelphia, Pennsylvania, USA
Relocation: Open to relocation
Visa: H1B
Resume file: GAGANA_KANUMURU_RESUME_1759957035461.pdf
Please check the file(s) for viruses. Files are checked manually and then made available for download.
GAGANAVARSHI KANUMURU
+14844472912 | [email protected] | www.linkedin.com/in/gaganakanumuru

TECHNICAL SKILLS
Languages: Python, Java, SQL (MySQL, PostgreSQL, SQL Server, PL/SQL), C/C++, JavaScript, Bash/Shell, HTML, CSS,
Bootstrap, React.js
Data Engineering & ETL: Data pipeline automation, API integration, ETL design, Web scraping (Selenium, BeautifulSoup), Data
Cleaning & Transformation, Batch & Stream processing, Workflow Scheduling (Airflow), dbt, Data Modeling, Warehousing
(Redshift, Snowflake, Databricks, SSIS-basic)
Analytics & Machine Learning: scikit-learn, PyTorch, OpenAI API, Hugging Face Transformers, Time Series Forecasting, Power
BI, Tableau (basic), Excel (pivot tables, VLOOKUP)
Backend & Frameworks: FastAPI, Django, Flask, REST, SOAP, JSON, Spring Boot, Microservices, Distributed Systems
Cloud Platforms: AWS (S3, Redshift, Lambda, QuickSight, Athena), Azure, GCP (BigQuery)
DevOps & Automation: Git, GitHub, GitHub Apps, CI/CD pipelines, Docker (basic), VS Code, PyCharm, Eclipse, Visual Studio
Systems & Device Management: Intune, JAMF, JIRA, Confluence, Active Directory, SCCM
Expertise Areas: Scalable ETL, Cloud Data Integration, API automation, Backend development, Process automation, Business
Intelligence
Soft Skills: Stakeholder collaboration, Team leadership, Training, Technical documentation, Strong analytical mindset

EXPERIENCE
ATC Analytics, Portland, OR Sep 2024 Present
Data Engineer
Automated and optimized backend data pipelines for 80K+ companies using Python, SQL, and GitHub APIs, resulting in 30% faster data processing and more accurate reporting for business intelligence use cases.
Developed validation frameworks for manual and automated testing of 5,000+ company matches, achieving 95%+ mapping accuracy for official GitHub organization assignments.
Enhanced search query automation tools for GitHub and Hugging Face, leveraging API integration and Python scripting to accelerate early-stage research and prototype delivery.
Implemented token rotation, request throttling, and error handling for scalable, resilient data scraping and collection processes, ensuring compliance with strict API rate limits.
Authored technical documentation, user guides, and process flowcharts, supporting effective knowledge transfer and fast onboarding for new team members.

Central Michigan University, Mount Pleasant, MI Jan 2023 May 2024
Technical Research Assistant
Provided direct hands-on technical support for over 300+ faculty, students, and staff, resolving hardware, software, network,
and system issues on Linux, Windows, and Mac environments.
Streamlined IT ticketing and root cause analysis processes, increasing issue resolution speed by 20% and enhancing cross-team
collaboration for escalated or complex technical cases.

DXC Technology Jun 2021 Dec 2022
Associate Professional Software Engineer
Automated insurance document processing and claims workflows using Java, Oracle Documaker, and custom scripts, eliminating repetitive manual entry and increasing process speed by 75%.
Designed and optimized data extraction and transformation scripts, improving the flow of customer and policy data into downstream systems and enabling faster reporting and auditing.
Collaborated with cross-functional teams in Agile sprints to analyze data processing issues, implement fixes, and improve overall data quality and consistency, leading to a 15% boost in client satisfaction.

PROJECTS
NLP-Powered Research Companion Chatbot
Developed a Dialogflow and Python-based chatbot to help students efficiently search and retrieve academic papers, integrating web scraping and keyword ranking logic to deliver the most relevant results.
Built and deployed a 24/7-accessible web interface (HTML, CSS, Netlify) and maintained user documentation for a seamless student research experience.

Solar Energy Forecasting with Machine Learning
Built and compared models (ANN, Moving Average, Fuzzy Logic) to predict solar energy using real PV output data.
Focused on modular design, clean evaluation flow, and reliable forecasting using Python and sklearn.
Keywords: cprogramm cplusplus continuous integration continuous deployment javascript business intelligence sthree information technology procedural language Michigan
Keywords: cprogramm cplusplus continuous integration continuous deployment javascript business intelligence sthree information technology procedural language Michigan

To remove this resume please click here or send an email from [email protected] to [email protected] with subject as "delete" (without inverted commas)
[email protected];6232
Enter the captcha code and we will send and email at [email protected]
with a link to edit / delete this resume
Captcha Image: