Lead Data Engineer at Remote, Remote, USA |
Email: [email protected] |
From: ashok, hclglobal [email protected] Reply to: [email protected] Lead Data Engineer Location: Remote ,need to support PST zone Note:H1B,USC,H4EAD,L2(Passport number Mandatory) Note:Retail domain Mandatory Overall Experience level: 12+ years in IT with min 8+ years of Data Engineering and Analyst experience. Mandatory Areas Must have skills. Spark, Pyspark, Python, Kubernetes, Docker, SQL, GCP, Big Data experienceb Optional Skill : Kubernetes,Hadoop,Sql Mandatory if Applicable Domain Experience (If any ) Retail Job Description: Assembling large to complex sets of data that meet non-functional and functional business requirements Identifying, designing and implementing internal process improvements including re-designing infrastructure for greater scalability, optimizing data delivery, and automating manual processes Building required infrastructure for optimal extraction, transformation and loading of data from various data sources using GCP/Azure and SQL technologies Building analytical tools to utilize the data pipeline, providing actionable insight into key business performance metrics including operational efficiency and customer acquisition Working with stakeholders including data, design, product and executive teams and assisting them with data-related technical issues Working with stakeholders including the Executive, Product, Data and Design teams to support their data infrastructure needs while assisting with data-related technical issues Strong background in data warehouse design Overseeing the integration of new technologies and initiatives into data standards and structures Strong Knowledge in Spark, PySpark, SQL, PL/SQL (Procedures, Function, Triggers, Packages and fixing the problems.) Experience in Cloud platform(GCP/Azure) data migration Source/Sink mapping, Build pipelines, work flow implementation, ETL and data validation processing Strong verbal and written communication skills to effectively share findings with shareholders Experience in Data Analytics, optimization, machine learning techniques or Python is added advantage Good understanding of web-based application development tech stacks like Java, AngularJs, NodeJs is a plus Key Responsibilities 20% Requirements and design 60% coding & testing and 10% review coding done by developers, analyse and help to solve problems 5% deployments and release planning 5% customer relations You bring: Bachelors degree in Computer Science, Computer Engineering or a software related discipline. A Masters degree in a related field is an added plus 6 + years of experience in Data Warehouse and Hadoop/Big Data 3+ years of experience in strategic data planning, standards, procedures, and governance 4+ years of hands-on experience in Python or Scala 4+ years of experience in writing and tuning SQLs, Spark queries 3+ years of experience working as a member of an Agile team Experience with Kubernetes and containers is a plus Experience in understanding and managing Hadoop Log Files. Experience in understanding Hadoop multiple data processing engines such as interactive SQL, real time streaming, data science and batch processing to handle data stored in a single platform in Yarn. Experience in Data Analysis, Data Cleaning (Scrubbing), Data Validation and Verification, Data Conversion, Data Migrations and Data Mining. Experience in all the phases of Data warehouse life cycle involving Requirement Analysis, Design, Coding, Testing, and Deployment., ETL Flow Experience in architecting, designing, installation, configuration and management of Apache Hadoop Clusters Experience in analyzing data in HDFS through Map Reduce, Hive and Pig Experience building and optimizing big data data pipelines, architectures and data sets. Strong analytic skills related to working with unstructured datasets Experience in Migrating Big Data Workloads Experience with data pipeline and workflow management tools: Airflow Experience with scripting languages: Python, Scala, etc. Cloud Administration For this role, we value: The ability to adapt quickly to a fast-paced environment Excellent written and oral communication skills A critical thinker that challenges assumptions and seeks new ideas Proactive sharing of accomplishments, knowledge, lessons, and updates across the organization Experience designing, building, testing and releasing software solutions in a complex, large organization Demonstrated functional and technical leadership Demonstrated analytical and problem-solving skills (ability to identify, formulate, and solve engineering problems) Keywords: information technology procedural language Lead Data Engineer [email protected] |
[email protected] View All |
12:41 AM 09-Jan-25 |