Platform Engineer - Reliability & Scale
LangChain(27 days ago)
About this role
As a Platform Engineer - Reliability & Scale at LangChain, you will design and implement high-throughput, data-intensive systems for LangSmith and LangGraph, ensuring their scalability and reliability. Your responsibilities include building monitoring and automated recovery systems, debugging performance issues, and shaping platform strategy through technical decision-making. You will also participate in on-call rotations to address incidents and drive improvements with a focus on operability and automation. A strong background in cloud infrastructure, database management, and software engineering is essential for success in this role.
Required Skills
- System Architecture
- Reliability Engineering
- Incident Response
- Performance Optimization
- Database Management
- Cloud Infrastructure
- Containerization
- Observability Tools
- Software Engineering
- Operational Practices
+4 more
About LangChain
www.langchain.comLangChain is a leading platform for building and deploying reliable AI agents, providing both engineering tools and open-source frameworks for developers. It streamlines the agent development lifecycle by offering products such as LangChain, LangGraph, and LangSmith, which cater to different levels of control and complexity in agent engineering. LangChain's innovative platform is designed to support various AI models, facilitating quick iteration and integration while ensuring high performance. With robust observability and evaluation tools, LangChain empowers organizations to enhance their AI capabilities and improve operational efficiencies across diverse use cases.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at LangChain
Similar Jobs
Staff Site Reliability Engineer
Synthesis Health(20 days ago)
Senior Site Reliability Engineer | Agentic AI
DeepL(2 months ago)
Senior Database Administrator
Kraken(1 month ago)
Director of Production Engineering (Reliability Platform Engineering)
Toshiba Global Commerce Solutions - External(11 days ago)
Site Reliability Engineer
ASAPP(22 days ago)
Staff Database Reliability Engineer
BILL(4 days ago)