Home

Nidhin Chandran - DEVOPS | LINUX | CLOUD ENGINEER
[email protected]
Location: Menlo Park, California, USA
Relocation:
Visa:
Nidhin Chandran S R
Senior DevOps Engineer | Cloud Infrastructure Specialist | Linux Engineer | SRE
Phone: +1 (650)519-9501
Location: Palo Alto, California
Email: [email protected]

PROFESSIONAL SUMMARY

Senior DevOps Engineer with over 13 years of IT experience, including 10 years specializing in AWS Cloud, Bare Metal Virtualization and CI/CD pipeline development. Proven expertise in cloud automation, containerization (Docker, Kubernetes), and multi-cloud operations. Skilled at Linux server optimization, scalable CI/CD implementation, and supporting ML/AI workflows using Kubeflow, MLflow and AWS SageMaker.

Cloud Expertise: Deep knowledge of AWS services (EC2, S3, RDS, Route 53, IAM, Lambda, ECS, EKS) and GCP for building scalable, secure, and high-availability cloud solutions.
Proficient in managing multi-cloud infrastructure using Terraform, CloudFormation and Ansible ensuring consistency and repeatability in deployments.
Containerized Deployments: Automated the deployment of containerized applications using Docker and orchestrated services with Kubernetes ensuring scalability fault tolerance and minimal downtime.
Adept at optimizing Linux server performance metrics through kernel tuning resource management and advanced monitoring tools like Nagios, Prometheus and Grafana.
Automated CI/CD Pipelines: Designed and implemented fully automated CI/CD pipelines using tools like Jenkins, GitHub, GitLab CI/CD and AWS CodePipeline.
Experience integrating cloud and on-premises systems using Dell Boomi AtomSphere for seamless data flow and analytics.
Proficient in deploying and managing virtualized environments using VMware and Proxmox virtualization platforms.
Set up Kubeflow pipelines on Kubernetes to automate ML workflows, including data preprocessing, model training using TensorFlow and PyTorch and batch inference.
Skilled in designing and configuring Virtual Data Centers in AWS Cloud, including VPC, Public and Private Subnets, Security Groups, Route Tables, and Elastic Load Balancers (ELBs) to support enterprise level data warehousing.
Hands-on experience with Bash scripting and Ansible Playbooks for server automation, package management and security patching.
Expertise in maintaining secure server-client communication using OpenSSL and Certbot,WinAcme certificates for certificate management
Implemented a fully automated CI/CD pipeline using Jenkins to build Dockerized applications, perform static code analysis with SonarQube, and publish artifacts to Nexus Repository. Leveraged Kubernetes (EKS) for orchestrating containerized workloads.
Skilled in deploying Java, PHP and Node.js applications with a focus on performance optimization and security enhancements across web servers such as HTTPD, Apache, Tomcat and Nginx.
Trained and mentored L1, L2, and L3 support teams enhancing their advanced troubleshooting capabilities.

CERTIFICATIONS
RHCE : RedHat Certified Engineer
MCSA : Microsoft Certified Solution Associate
MS-700 : Microsoft Certified Teams Administrator Associate
Ribbon SBC Edge Technical : SBCE11
AWS Certified Solution Architect
AWS Certified Machine Learning Engineer-Associate




TECHNICAL SKILLS
Cloud Platforms AWS (EC2,VPC,RDS,IAM, CloudFormation, S3, ELB, Auto Scaling, CloudFront, Route 53, CloudWatch, ,AWS SageMaker, EBS),Azure, OVH (Dedicated Cloud Hosting),VMware,GCP
DevOps Tools | CI/CD Jenkins,Ansible,Terraform,GitHub,Azure Pipelines,AWS Code Pipeline,Docker,
Kubernetes,SonarQube,Nexus, Python, Bash Shell Scripting, Git,GitHub,GitLab,
Bitbucket,Azure Repos
Monitoring and Security Nagios,Prometheus,Grafana,Splunk,OSSEC,Lynis,Wireshark,Solar-Winds,Nessus,ELK,chkrootkit,Rkhunter
Databases MySQL,PostgreSQL,RDS,Cassandra,DynamoDB,Redis,Redshift
Operating Systems Linux(RHEL,Ubuntu,Suse,Debian,CentOS),Windows Server 2008R2,2012,2016
Network Protocols SSH, DNS, DHCP, HTTP/HTTPS, FTP, LDAP, Samba, NFS


PROFESSIONAL EXPEREINCE

WPP, San Francisco, CA August 2024 - Present
Role: Sr. DevOps Engineer
Description: My role was pivotal in automating continuous integration/continuous Deployment pipelines, managing hybrid cloud infrastructures, optimizing containerized microservices, and implementing advanced monitoring and security solutions. By driving infrastructure automation and enabling seamless deployments collaborating with Data Science team, I have reduced deployment times by 40%, minimized downtime to 99.99% availability, and improved operational efficiency delivering measurable cost savings of $3 million annually.

Responsibilities
Designed and managed scalable and highly available cloud infrastructures on AWS using services like EC2, VPC, RDS, IAM, S3, and CloudFormation.
Engineered fully automated CI/CD pipelines using Jenkins, SonarQube and Nexus, reducing deployment times by 40%.
Implemented centralized logging and monitoring systems using ELK Stack (Elasticsearch, Logstash, Kibana) and AWS CloudWatch, improving system observability, real-time alerting.
Integrated AWS SageMaker for training and deploying machine learning models with TensorFlow and PyTorch.
Engineered high-availability and disaster recovery solutions using Pacemaker, Corosync, and Rsync, ensuring near-zero downtime for critical services.
Collaborate with data science and engineering teams to design and implement optimized workflows for ML/AI workloads.
Automated VM provisioning and configuration in Proxmox using Ansible, ensuring consistent environments for data scientists and developers.
Demonstrates a strong willingness to adapt to new technologies and embraces self-learning to efficiently deliver high-quality solutions in evolving technical environments.
Work on log processing pipelines with tools like Kafka and Logstash to ensure reliable ingestion and search indexing
Collaborated with stakeholders to gather requirements and translate business needs into scalable cloud solutions.
Integrated automated testing and deployment workflows to ensure seamless application delivery and high-quality releases.
Experience with Kubeflow for building, orchestrating, and deploying ML workflows on Kubernetes.
Deployed and orchestrated containerized applications using Docker and Kubernetes, implementing dynamic scaling, rolling updates, and auto-healing capabilities to ensure high availability and reliability of services.
Manage and optimize virtualized environments using Proxmox, VMware vSphere, and KVM/QEMU, ensuring resource allocation aligns with organizational needs.
Enhance collaboration across teams, ensuring seamless integration of development, testing, and deployment workflows.
Environment/Tools: Git,GitHub,Jenkins,Maven,SonarQube,Nexus,Kubernetes,EKS,AWS Fargate,AWS ECS, Prometheus,Grafana,ELK Stack,Terraform,AWS SageMaker,PyTorch

COMCAST, Philadelphia (Remote) Aug 2022 June 2024
Role: DevOps Engineer
Responsibilities
Deployed and managed LXC containers on Proxmox for lightweight virtualization, streamlining resource provisioning and streamlining resource provisioning and improving the efficiency of DevOps workflows through reduced overhead and faster container lifecycle management.
Configured IAM roles, accounts and policies to ensure secure and controlled access across AWS resource.
Managed Docker images in Nexus Artifactory for efficient container lifecycle management and deployment workflows.
Set up monitoring, alarms and real-time notifications using AWS CloudWatch, Nagios, and the ELK Stack (Elasticsearch, Logstash, Kibana), enabling proactive infrastructure monitoring.
Implemented AWS solutions using EC2,S3,RDS,IAM,Elastic Load Balancers (ELB),Auto Scaling Groups and Route 53 for DNS management, ensuring scalable, secure and highly available Infrastructure.
Performed blue-green and canary deployments to enable seamless application updates while ensuring high reliability and minimal downtime.
Deployed and optimized databases like MySQL, PostgreSQL, and MongoDB Clustering in virtualized and containerized environments.
Configured automated database snapshots using AWS Snapshots and Percona XtraBackup, ensuring consistent backups and migration without affecting performance.
Sends log files to Amazon S3 buckets for long-term storage and compliance purposes
Designed CloudFormation and Terraform templates to provision AWS infrastructure-as-code including networking, compute, storage, security, and monitoring.
Developed and maintained end-to-end CI/CD pipelines using Jenkins, automating code integration, testing, and deployment processes.

Environment/Tools: Linux,AWS,Docker, Kubernetes, Jenkins, Nexus Artifactory, Nagios, ELK stack, Ansible, GitLab, Route53, IAM, CloudWatch, S3, EC2, Security Groups, TCP/IP, DNS, Shell/Bash Scripting

HTC Global, India Aug 2020 Aug 2022
Role: SRE | Devops Engineer
Responsibilities
Engineered comprehensive monitoring and observability solutions utilizing Prometheus, Grafana, and Datadog; established alerts that led to a 40% decrease in incident response times and improved system reliability metrics by 25%.
Integrated monitoring tools with incident management systems such as PagerDuty, reducing mean time to resolution (MTTR) by streamlining alert and escalation workflows.
Developed and maintained Ansible playbooks to automate server provisioning and configuration processes, ensuring consistent and efficient deployments across environments. Integrated Rundeck for task orchestration and self-service automation.
Actively participated in post-incident reviews and root cause analysis (RCA), documenting findings and implementing preventive measures to improve system reliability.
Automated incident response and ticket resolution processes using custom scripts, improving operational efficiency and reducing manual intervention.
Regularly audited and reviewed monitoring systems to align with evolving business requirements and system needs.
Orchestrated end-to-end data pipelines, integrating Kafka with splunk streaming, to facilitate seamless data flow from producers to consumers.
Set up monitoring dashboards using tools like Grafana and Kafka Manager, providing real-time insights into stream health, throughput, and latency.
Environment/Tools: Prometheus, Grafana, Datadog, PagerDuty, New Relic, AppDynamics, Dynatrace, ELK Stack, Splunk, JMeter, Gatling, Locust, Apache Bench, CloudWatch, Ansible, RCA documentation.
GBS PLUS PVT LTD, India Apr 2018 Jul 2020
Role: Linux | Devops Engineer
Responsibilities
Installed, upgraded, and managed packages on Red Hat Linux and Debian servers using YUM,RPM and APT tools.
Linux server hardening strategies by configuring Fail2ban to prevent brute-force attacks, enforcing access controls with SELinux, AppArmor and securing network traffic using Firewalld and iptables.
Installed, configured, and maintained application servers like WebSphere and WebLogic, as well as web servers including Apache, HTTPD, and Tomcat on Linux and UNIX platforms.
Managed domain mapping and SSL certificate integration, including free SSL solutions like Certbot, and handled domain management using cPanel and providers like GoDaddy.
CI/CD pipelines using Jenkins to automate build, test, and deployment processes.
Configured and installed SSH (Secure Shell) encryption for secure access on Ubuntu and Red Hat Linux systems.
Conduct vulnerability assessments using Nessus,Qualys and Lynis and mitigate risks by applying patches,upgrades.
Manage storage solutions using LVM, HDFS, Ceph, GlusterFS, and distributed storage for scalable data management.
Created and maintained VM images for VMware vSphere, RedHat OpenShift,VMware Workstation.
Perform advanced kernel tuning, resource allocation, and performance optimization for large-scale deployments.
Automated backups using Crontab based on client-specific requirements
Set up password-less authentication and agent forwarding for secure server access using ssh-keygen.
Expertise in Linux system security hardening, vulnerability assessments, and ensuring compliance with industry standards like PCI DSS and GDPR

Environment/Tools: Windows Server 2012, CentOS 8.x, Red Hat Enterprise Linux 9, YUM, RPM, Bash, Shell, HTML, Firewall, Apache, Tomcat, LDAP, NFS, Samba, SSH, DHCP, DNS, Kickstart, TCP/IP, WebSphere, WebLogic, Nagios

Techvantage Systems, India Nov 2012 Mar 2018
Role: Sr.System Administrator
Responsibilities
Installed, configured, and maintained Windows Server environments, ensuring stable and secure operations.
Performed server patching and updates using Windows Server Update Services (WSUS) and automated tools to maintain compliance and reduce vulnerabilities.
Managed Active Directory (AD), including creating and managing users, groups, and organizational units (OUs) to maintain efficient directory structures
Configured role-based access control (RBAC) to enforce least privilege access policies across user accounts, enhancing security.
Implemented and maintained Group Policy Objects (GPOs) to enforce organizational security and compliance requirements, such as password policies and application restrictions
Deployed and maintained System Center Configuration Manager (SCCM) for operating system imaging, application deployment, and patch management, streamlining endpoint management.
Designed and managed Active Directory (AD) infrastructure, including domains, forests, and trust relationships, ensuring seamless authentication and resource access across the organization.
Created and maintained standardized system images using Sysprep for consistent deployment across diverse hardware and virtual environments.
Administering Office 365 applications, including email (Exchange Online), SharePoint, OneDrive, and Teams.

Environment/Tools:Windows Server 2008 R2, Windows Server 2012, CentOS, Ubuntu Server, VMware ESXi, Task Manager, Resource Monitor, Performance Monitor, YUM, Hyper-V.
Keywords: continuous integration continuous deployment artificial intelligence machine learning javascript sthree active directory rlang information technology microsoft California

To remove this resume please click here or send an email from [email protected] to [email protected] with subject as "delete" (without inverted commas)
[email protected];4780
Enter the captcha code and we will send and email at [email protected]
with a link to edit / delete this resume
Captcha Image: