| Gaganavarshi kanumuru - Data Engineer |
| [email protected] |
| Location: Philadelphia, Pennsylvania, USA |
| Relocation: Open to relocation |
| Visa: H1B |
| Resume file: GAGANA_KANUMURU_RESUME_1759957035461.pdf Please check the file(s) for viruses. Files are checked manually and then made available for download. |
|
GAGANAVARSHI KANUMURU
+14844472912 | [email protected] | www.linkedin.com/in/gaganakanumuru TECHNICAL SKILLS Languages: Python, Java, SQL (MySQL, PostgreSQL, SQL Server, PL/SQL), C/C++, JavaScript, Bash/Shell, HTML, CSS, Bootstrap, React.js Data Engineering & ETL: Data pipeline automation, API integration, ETL design, Web scraping (Selenium, BeautifulSoup), Data Cleaning & Transformation, Batch & Stream processing, Workflow Scheduling (Airflow), dbt, Data Modeling, Warehousing (Redshift, Snowflake, Databricks, SSIS-basic) Analytics & Machine Learning: scikit-learn, PyTorch, OpenAI API, Hugging Face Transformers, Time Series Forecasting, Power BI, Tableau (basic), Excel (pivot tables, VLOOKUP) Backend & Frameworks: FastAPI, Django, Flask, REST, SOAP, JSON, Spring Boot, Microservices, Distributed Systems Cloud Platforms: AWS (S3, Redshift, Lambda, QuickSight, Athena), Azure, GCP (BigQuery) DevOps & Automation: Git, GitHub, GitHub Apps, CI/CD pipelines, Docker (basic), VS Code, PyCharm, Eclipse, Visual Studio Systems & Device Management: Intune, JAMF, JIRA, Confluence, Active Directory, SCCM Expertise Areas: Scalable ETL, Cloud Data Integration, API automation, Backend development, Process automation, Business Intelligence Soft Skills: Stakeholder collaboration, Team leadership, Training, Technical documentation, Strong analytical mindset EXPERIENCE ATC Analytics, Portland, OR Sep 2024 Present Data Engineer Automated and optimized backend data pipelines for 80K+ companies using Python, SQL, and GitHub APIs, resulting in 30% faster data processing and more accurate reporting for business intelligence use cases. Developed validation frameworks for manual and automated testing of 5,000+ company matches, achieving 95%+ mapping accuracy for official GitHub organization assignments. Enhanced search query automation tools for GitHub and Hugging Face, leveraging API integration and Python scripting to accelerate early-stage research and prototype delivery. Implemented token rotation, request throttling, and error handling for scalable, resilient data scraping and collection processes, ensuring compliance with strict API rate limits. Authored technical documentation, user guides, and process flowcharts, supporting effective knowledge transfer and fast onboarding for new team members. Central Michigan University, Mount Pleasant, MI Jan 2023 May 2024 Technical Research Assistant Provided direct hands-on technical support for over 300+ faculty, students, and staff, resolving hardware, software, network, and system issues on Linux, Windows, and Mac environments. Streamlined IT ticketing and root cause analysis processes, increasing issue resolution speed by 20% and enhancing cross-team collaboration for escalated or complex technical cases. DXC Technology Jun 2021 Dec 2022 Associate Professional Software Engineer Automated insurance document processing and claims workflows using Java, Oracle Documaker, and custom scripts, eliminating repetitive manual entry and increasing process speed by 75%. Designed and optimized data extraction and transformation scripts, improving the flow of customer and policy data into downstream systems and enabling faster reporting and auditing. Collaborated with cross-functional teams in Agile sprints to analyze data processing issues, implement fixes, and improve overall data quality and consistency, leading to a 15% boost in client satisfaction. PROJECTS NLP-Powered Research Companion Chatbot Developed a Dialogflow and Python-based chatbot to help students efficiently search and retrieve academic papers, integrating web scraping and keyword ranking logic to deliver the most relevant results. Built and deployed a 24/7-accessible web interface (HTML, CSS, Netlify) and maintained user documentation for a seamless student research experience. Solar Energy Forecasting with Machine Learning Built and compared models (ANN, Moving Average, Fuzzy Logic) to predict solar energy using real PV output data. Focused on modular design, clean evaluation flow, and reliable forecasting using Python and sklearn. Keywords: cprogramm cplusplus continuous integration continuous deployment javascript business intelligence sthree information technology procedural language Michigan Keywords: cprogramm cplusplus continuous integration continuous deployment javascript business intelligence sthree information technology procedural language Michigan |