Nidhin Chandran Sreedevi Ramachandran - DevOps Engineer/Linux/Clould |
[email protected] |
Location: Palo Alto, California, USA |
Relocation: yes |
Visa: H1B |
Nidhin Chandran S R
Senior DevOps Engineer | Cloud Infrastructure Specialist | Linux Engineer Phone: +1 650519501 Location: Palo Alto, California Email: [email protected] linkedin.com/in/nidhin-chandran PROFESSIONAL SUMMARY Senior DevOps Engineer with over 15 years of IT experience, including 10 years specializing in AWS Cloud, Bare Metal Virtualization and CI/CD pipeline development. Proven expertise in cloud automation, containerization (Docker, Kubernetes), and multi-cloud operations. Skilled at Linux server optimization, scalable CI/CD implementation, and supporting ML/AI workflows using Kubeflow, MLflow and AWS SageMaker. Cloud Expertise: Deep knowledge of AWS services (EC2, S3, RDS, Route 53, IAM, Lambda, ECS, EKS) and GCP for building scalable, secure, and high-availability cloud solutions. Proficient in managing multi-cloud infrastructure using Terraform, CloudFormation and Ansible ensuring consistency and repeatability in deployments. Containerized Deployments: Automated the deployment of containerized applications using Docker and orchestrated services with Kubernetes ensuring scalability fault tolerance and minimal downtime. Adept at optimizing Linux server performance through kernel tuning resource management and advanced monitoring tools like Nagios, Prometheus and Grafana. Automated CI/CD Pipelines: Designed and implemented fully automated CI/CD pipelines using tools like Jenkins, GitHub, GitLab CI/CD and AWS CodePipeline. Experience integrating cloud and on-premises systems using Dell Boomi AtomSphere for seamless data flow and analytics. Proficient in deploying and managing virtualized environments using VMware and Proxmox virtualization platforms. Set up Kubeflow pipelines on Kubernetes to automate ML workflows, including data preprocessing, model training using TensorFlow and PyTorch and batch inference. Skilled in designing and configuring Virtual Data Centers in AWS Cloud, including VPC, Public and Private Subnets, Security Groups, Route Tables, and Elastic Load Balancers (ELBs) to support enterprise level data warehousing. Hands-on experience with Bash scripting and Ansible Playbooks for server automation, package management and security patching. Expertise in maintaining secure server-client communication using OpenSSL and Certbot,WinAcme certificates for certificate management Implemented a fully automated CI/CD pipeline using Jenkins to build Dockerized applications, perform static code analysis with SonarQube, and publish artifacts to Nexus Repository. Leveraged Kubernetes (EKS) for orchestrating containerized workloads. Skilled in deploying Java, PHP and Node.js applications with a focus on performance optimization and security enhancements across web servers such as HTTPD, Apache, Tomcat and Nginx. Trained and mentored L1, L2, and L3 support teams enhancing their advanced troubleshooting capabilities. CERTIFICATIONS RHCE : RedHat Certified Engineer MCSA : Microsoft Certified Solution Associate MS-700 : Microsoft Certified Teams Administrator Associate Ribbon SBC Edge Technical : SBCE11 AWS Certified Solution Architect Databricks Certified Generative AI Engineer Associate TECHNICAL SKILLS Cloud Platforms AWS (EC2,VPC,RDS,IAM, CloudFormation, S3, ELB, Auto Scaling, CloudFront, Route 53, CloudWatch, ,AWS SageMaker, EBS),Azure, OVH (Dedicated Cloud Hosting),VMware DevOps Tools | CI/CD Jenkins,Ansible,Terraform,GitHub,Azure Pipelines,AWS Code Pipeline,Docker, Kubernetes,SonarQube,Nexus, Python, Bash Shell Scripting, Git,GitHub,GitLab, Bitbucket,Azure Repos Monitoring and Security Nagios,Prometheus,Grafana,Splunk,OSSEC,Lynis,Wireshark,Solar-Winds Databases MySQL,PostgreSQL,RDS,Cassandra,DynamoDB,Redis,Redshift Operating Systems Linux(RHEL,Ubuntu,Suse,Debian,CentOS),Windows Server 2008R2,2012,2016 Network Protocols SSH, DNS, DHCP, HTTP/HTTPS, FTP, LDAP, Samba, NFS PROFESSIONAL EXPEREINCE WPP, San Francisco, CA Sep 2023 Present Role: Sr. DevOps Engineer Description: My role was pivotal in automating CI/CD pipelines, managing hybrid cloud infrastructures, optimizing containerized microservices, and implementing advanced monitoring and security solutions. By driving infrastructure automation and enabling seamless deployments, I have reduced deployment times by 40%, minimized downtime to 99.99% availability, and improved operational efficiency delivering measurable cost savings of $3 million annually. Responsibilities Designed and managed scalable and highly available cloud infrastructures on AWS using services like EC2, VPC, RDS, IAM, S3, and CloudFormation. Engineered fully automated CI/CD pipelines using Jenkins, SonarQube and Nexus, reducing deployment times by 40%. Implemented centralized logging and monitoring systems using ELK Stack (Elasticsearch, Logstash, Kibana) and AWS CloudWatch, improving system observability, real-time alerting. Integrated AWS SageMaker for training and deploying machine learning models with TensorFlow and PyTorch. Engineered high-availability and disaster recovery solutions using Pacemaker, Corosync, and Rsync, ensuring near-zero downtime for critical services. Collaborate with data science and engineering teams to design and implement optimized workflows for ML/AI workloads. Automated VM provisioning and configuration in Proxmox using Ansible, ensuring consistent environments for data scientists and developers. Demonstrates a strong willingness to adapt to new technologies and embraces self-learning to efficiently deliver high-quality solutions in evolving technical environments. Work on log processing pipelines with tools like Kafka and Logstash to ensure reliable ingestion and search indexing Collaborated with stakeholders to gather requirements and translate business needs into scalable cloud solutions. Integrated automated testing and deployment workflows to ensure seamless application delivery and high-quality releases. Experience with Kubeflow for building, orchestrating, and deploying ML workflows on Kubernetes. Deployed and orchestrated containerized applications using Docker and Kubernetes, implementing dynamic scaling, rolling updates, and auto-healing capabilities to ensure high availability and reliability of services. Manage and optimize virtualized environments using Proxmox, VMware vSphere, and KVM/QEMU, ensuring resource allocation aligns with organizational needs. Configured and installed SSH (Secure Shell) encryption for secure access on Ubuntu and Red Hat Linux Environment/Tools: Git,GitHub,Jenkins,Maven,SonarQube,Nexus,Kubernetes,EKS,AWS Fargate,AWS ECS, Prometheus,Grafana,ELK Stack,Terraform,AWS SageMaker,PyTorch COMCAST, Philadelphia Aug 2022 Aug 2023 Role: DevOps Engineer Responsibilities Deployed and managed LXC containers on Proxmox for lightweight virtualization, streamlining resource provisioning and streamlining resource provisioning and improving the efficiency of DevOps workflows through reduced overhead and faster container lifecycle management. Configured IAM roles, accounts and policies to ensure secure and controlled access across AWS resource. Managed Docker images in Nexus Artifactory for efficient container lifecycle management and deployment workflows. Set up monitoring, alarms and real-time notifications using AWS CloudWatch, Nagios, and the ELK Stack (Elasticsearch, Logstash, Kibana), enabling proactive infrastructure monitoring. Implemented AWS solutions using EC2,S3,RDS,IAM,Elastic Load Balancers (ELB),Auto Scaling Groups and Route 53 for DNS management, ensuring scalable, secure and highly available Infrastructure. Performed blue-green and canary deployments to enable seamless application updates while ensuring high reliability and minimal downtime. Deployed and optimized databases like MySQL, PostgreSQL, and MongoDB Clustering in virtualized and containerized environments. Configured automated database snapshots using AWS Snapshots and Percona XtraBackup, ensuring consistent backups and migration without affecting performance. Sends log files to Amazon S3 buckets for long-term storage and compliance purposes Designed CloudFormation and Terraform templates to provision AWS infrastructure-as-code including networking, compute, storage, security, and monitoring. Developed and maintained end-to-end CI/CD pipelines using Jenkins, automating code integration, testing, and deployment processes. Environment/Tools: Linux,AWS,Docker, Kubernetes, Jenkins, Nexus Artifactory, Nagios, ELK stack, Ansible, GitLab, Route53, IAM, CloudWatch, S3, EC2, Security Groups, TCP/IP, DNS, Shell/Bash Scripting HTC Global, India Aug 2020 Aug 2022 Role: SRE|Devops Engineer Responsibilities Engineered comprehensive monitoring and observability solutions utilizing Prometheus, Grafana, and Datadog; established alerts that led to a 40% decrease in incident response times and improved system reliability metrics by 25%. Integrated monitoring tools with incident management systems such as PagerDuty, reducing mean time to resolution (MTTR) by streamlining alert and escalation workflows. Developed and maintained Ansible playbooks to automate server provisioning and configuration processes, ensuring consistent and efficient deployments across environments. Integrated Rundeck for task orchestration and self-service automation. Actively participated in post-incident reviews and root cause analysis (RCA), documenting findings and implementing preventive measures to improve system reliability. Automated incident response and ticket resolution processes using custom scripts, improving operational efficiency and reducing manual intervention. Regularly audited and reviewed monitoring systems to align with evolving business requirements and system needs. Orchestrated end-to-end data pipelines, integrating Kafka with splunk streaming, to facilitate seamless data flow from producers to consumers. Set up monitoring dashboards using tools like Grafana and Kafka Manager, providing real-time insights into stream health, throughput, and latency. Environment/Tools: Prometheus, Grafana, Datadog, PagerDuty, New Relic, AppDynamics, Dynatrace, ELK Stack, Splunk, JMeter, Gatling, Locust, Apache Bench, CloudWatch, Ansible, RCA documentation. GBS PLUS PVT LTD, India Apr 2018 Jul 2020 Role: Linux | Devops Engineer Responsibilities Installed, upgraded, and managed packages on Red Hat Linux and Debian servers using YUM,RPM and APT tools. Linux server hardening strategies by configuring Fail2ban to prevent brute-force attacks, enforcing access controls with SELinux, AppArmor and securing network traffic using Firewalld and iptables. Installed, configured, and maintained application servers like WebSphere and WebLogic, as well as web servers including Apache, HTTPD, and Tomcat on Linux and UNIX platforms. Managed domain mapping and SSL certificate integration, including free SSL solutions like Certbot, and handled domain management using cPanel and providers like GoDaddy. CI/CD pipelines using Jenkins to automate build, test, and deployment processes. Configured and installed SSH (Secure Shell) encryption for secure access on Ubuntu and Red Hat Linux systems. Conduct vulnerability assessments using Nessus,Qualys and Lynis and mitigate risks by applying patches,upgrades. Manage storage solutions using LVM, HDFS, Ceph, GlusterFS, and distributed storage for scalable data management. Created and maintained VM images for VMware vSphere, RedHat OpenShift,VMware Workstation. Perform advanced kernel tuning, resource allocation, and performance optimization for large-scale deployments. Automated backups using Crontab based on client-specific requirements Set up password-less authentication and agent forwarding for secure server access using ssh-keygen. Expertise in Linux system security hardening, vulnerability assessments, and ensuring compliance with industry standards like PCI DSS and GDPR Environment/Tools: Windows Server 2012, CentOS 8.x, Red Hat Enterprise Linux 9, YUM, RPM, Bash, Shell, HTML, Firewall, Apache, Tomcat, LDAP, NFS, Samba, SSH, DHCP, DNS, Kickstart, TCP/IP, WebSphere, WebLogic, Nagios Techvantage Systems, India Nov 2010 Mar 2018 Role: Sr.System Administrator Responsibilities Installed, configured, and maintained Windows Server environments, ensuring stable and secure operations. Performed server patching and updates using Windows Server Update Services (WSUS) and automated tools to maintain compliance and reduce vulnerabilities. Managed Active Directory (AD), including creating and managing users, groups, and organizational units (OUs) to maintain efficient directory structures Configured role-based access control (RBAC) to enforce least privilege access policies across user accounts, enhancing security. Implemented and maintained Group Policy Objects (GPOs) to enforce organizational security and compliance requirements, such as password policies and application restrictions Deployed and maintained System Center Configuration Manager (SCCM) for operating system imaging, application deployment, and patch management, streamlining endpoint management. Designed and managed Active Directory (AD) infrastructure, including domains, forests, and trust relationships, ensuring seamless authentication and resource access across the organization. Created and maintained standardized system images using Sysprep for consistent deployment across diverse hardware and virtual environments. Environment/Tools:Windows Server 2008 R2, Windows Server 2012, CentOS, Ubuntu Server, VMware ESXi, Task Manager, Resource Monitor, Performance Monitor, YUM, Hyper-V. Education Bachelor of Science in Computer Science University of Kerala, Kerala 2006 2009 Master of Science in Computer Applications University of Kerala, Kerala 2009 2012 Keywords: continuous integration continuous deployment artificial intelligence machine learning javascript sthree active directory rlang information technology microsoft California |