AI Platform Architect at Remote, Remote, USA |
Email: srivalli@fluxteksol.com |
From: Srivalli, Fluxteksolutions srivalli@fluxteksol.com Reply to: srivalli@fluxteksol.com Hi Role: AI Platform Architect Location: NYC: Hybrid 2 Days Onsite Visa: H1b, GC, H4-EAD Experience: 12-15 Responsibilities: Design and develop end-to-end applications that seamlessly integrate machine learning capabilities, including real-time inference, batch processing, and efficient data management to deliver scalable and robust solutions. Identify bottlenecks in the model development, deployment, and monitoring process. Design and implement production-ready machine learning pipelines, including model training, validation, deployment, and monitoring (e.g., labeled data sets to check the performance of prompts). Build scalable, high-performance infrastructure to support Generative AI workflows (e.g., distributed training, inference optimization, and GPU/TPU utilization). Deploy GenAI applications into production cloud environments with performance, cost, and latency trade-offs considered (e.g., open-source vs. closed-source, quantization, prompt token length, completion caching, prompt caching). Monitor and troubleshoot model performance, addressing issues such as performance drift and response latency. Stay at the forefront of Generative AI advancements, identifying opportunities to incorporate the latest research and techniques into production systems. Qualifications: Bachelors or advanced degree in computer science, engineering, or a related field. 3+ years of experience in machine learning engineering, with a focus on deploying AI systems at scale. Experience working with large-scale Generative AI applications in production environments. Relevant experience in the legal domain is a plus. Strong proficiency in Python and machine learning frameworks (e.g., TensorFlow, PyTorch). Experience with Generative AI tools and techniques (e.g., LLMs, quantization, synthetic data generation, knowledge distillation, retrieval-augmented generation, fine-tuning). Proficiency in common GenAI libraries (e.g., LangChain, Autogen) and cloud-native AI services (e.g., Azure search) Knowledge of cloud infrastructure (e.g., Azure) and management tools for IT components, storage, networking, and caching. Familiarity with ML Ops principles, including CI/CD pipelines, containerization, and automated testing for AI systems. Experience with modern container platforms (e.g., Docker, OpenShift) and tools like Jenkins, Git, and Sonar. Strong problem-solving skills with the ability to address complex technical challenges. Excellent communication skills to collaborate with cross-functional teams and explain technical concepts to non-technical stakeholders. Eagerness to stay updated with cutting-edge AI research and apply innovative ideas to real-world problems. Organization and attention to detail, ensuring high-quality delivery. Ability to work collaboratively to create innovative and efficient solutions. -- Thanks & Regards Srivalli. Gutti US IT Recruiter Email: srivalli@fluxteksol.com Direct: +1 8323070476, www.linkedin.com/in/srivalli-gutti-3a68aa263 -- Thanks & Regards Srivalli. Gutti US IT Recruiter Email: srivalli@fluxteksol.com Direct: +1 8323070476, www.linkedin.com/in/srivalli-gutti-3a68aa263 https://chat.whatsapp.com/C83xhWRw3Sx9S7m9k7S8qa Keywords: continuous integration continuous deployment artificial intelligence machine learning information technology green card AI Platform Architect srivalli@fluxteksol.com |
srivalli@fluxteksol.com View All |
08:56 AM 10-Jan-25 |