What you will do
Develop and optimize RAG pipelines to enhance LLM-driven troubleshooting.
Work with PyVector to store and retrieve relevant data efficiently.
Implement vector embedding models to power retrieval and improve accuracy.
Build high-quality Python code to power our AI applications.
Deploy and scale models on AWS (S3, Lambda, ECS, SageMaker, Bedrock).
Fine-tune and evaluate LLMs for task-specific reasoning and accuracy.
Implement efficient document indexing and retrieval for knowledge-intensive queries.
Experiment, iterate, and deploy features quickly based on real customer feedback.
Work with a small but highly skilled team to push the limits of industrial AI.
Take ownership of challenges, figure things out as you go, and adapt to changing priorities.