Home

Site Reliability Engineer with Machine Learning | Remote at Remote, Remote, USA
Email: [email protected]
http://bit.ly/4ey8w48
https://jobs.nvoids.com/job_details.jsp?id=656049&uid=

From:

Akash Mandal,

trooBell Technologies LLC.

[email protected]

Reply to: [email protected]

Hello,

Hope you are doing well,

We have an immediate opening for the below role, kindly please share best match profiles according to the role.

Role: Site Reliability Engineer with Machine Learning

Location: Remote

Duration: 6+ Months

Interview: Video/ Phone

Total Experience: 8+ Years

Must have active LinkedIn profile

Key to this position
: Must have ML (Machine Learning Experience). Needs strong Kubernetes and Docker. Must have experience with Lucene/Solr. Strong skills writing production level code. Strong communication skills.

Pro-tip:
ETSY Loves people out of these 4 companies (in order): Pinterest, Google, Meta, & Shopify. People out of these companies get the most office.

Job Description:

The ML SRE team supports the infrastructure and provides a developer-focused, scalable, and reliable infrastructure to develop and deploy ML services seamlessly.

Qualifications:

About the Role

Design, build and support the core infrastructure used by all ML services, including on-call production support rotations.

Work cross-functionally with various platform teams, ML teams and product partners to build the next generation of our high availability ML services in the cloud.

Build and maintain observability and test tooling - logging, monitoring, distributed tracing, alerting and offline test tools needed.

Practice continuous learning and agile delivery model to stay informed and focused on our deliverables.

Support GKE services and maintenance that includes software upgrades, performance tuning and GKE cluster tuning and optimization.

Build GKE Tooling and automate deployments.

About You:

You have solid engineering and coding skills, data structure knowledge and ability to write high performance production quality code.

You have experience working with languages like Java, Scala, Python, Go or other equivalent languages.

You are a strong collaborator and communicator and you make the engineers around you grow and learn.

You have fundamental experience with infrastructure engineering and strong troubleshooting skills.

You have solid background and hands-on experience with Cloud technologies either Google Cloud or AWS.

Experience with search technologies such as Lucene/Solr or Elasticsearch is a plus.

Experience with supporting ML Services is a plus.

Experience with Kubernetes and Docker is a plus.

Experience with Unix/Linux operating systems and networking stack (e.g., TCP/IP, routing, network topologies and hardware, SDN) is a plus.

Experience with Grafana is a plus.

Thanks, and Regards

Akash Mandal | Technical Recruiter

Email |
LinkedIn

trooBell Technologies LLC.

Address: 9420, River Lake Drive, Roswell, GA, 30075

Keywords: machine learning golang Georgia
http://bit.ly/4ey8w48
https://jobs.nvoids.com/job_details.jsp?id=656049&uid=
[email protected]
View All
09:35 PM 19-Sep-23


To remove this job post send "job_kill 656049" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.

Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]


Time Taken: 0

Location: ,