An exceptional opportunity for a Machine Learning Engineer (with Full-Stack experience) to join an innovative market leader at the forefront of developing next-generation solutions that transform digital interactions. The role will focus on projects to leverage state-of-the-art generative AI, retrieval-augmented generation (RAG), and reasoning frameworks to build intelligent and context-aware systems.
We are seeking talented Machine Learning Engineers with full-stack software development experience to join our client's team and help shape the future of AI-powered automation. Within this dynamic role varied duties will include:
- Search relevancy engineering.
- Conversational AI Development: Design, train, fine-tune, and deploy LLMs with reasoning capabilities.
- Retrieval-Augmented Generation (RAG): Implement, optimise, and scale RAG pipelines for effective information retrieval from structured and unstructured sources.
- Model Fine-Tuning & Training: Train domain-specific models using techniques like LoRA, QLoRA, PEFT, reinforcement learning, and supervised fine-tuning (SFT).
- Model Deployment & Inferencing: Optimise model serving and inference using vLLM, DeepSpeed, TensorRT, Triton, and other acceleration frameworks.
- Multi-Agent Systems: Develop and integrate agentic capabilities using frameworks such as LangChain, CrewAI, AutoGen, and DSPy.
- AWS Cloud & MLOps: Deploy scalable machine learning workloads on AWS using services like SageMaker, Bedrock, Lambda, S3, DynamoDB, ECS, and EKS.
- End-to-End AI Product Development: Work across the full ML lifecycle, from data collection and preprocessing to model evaluation, deployment, and monitoring.
- Full-Stack Integration: Develop APIs and integrate ML models into web applications using FastAPI, Flask, React, TypeScript, and Node.js.
- Vector Databases & Search: Implement embeddings and retrieval mechanisms using Pinecone, Weaviate, FAISS, Milvus, ChromaDB, or OpenSearch.
Required skills & experience:
- 3-5+ years in machine learning and software development
- Proficient in Python, PyTorch or TensorFlow or Hugging Face Transformers
- Experience with RAG, LLM fine-tuning, and expertise in AWS and cloud-native AI deployments.
- Full-stack experience (React, TypeScript, Node.js) and API development.
- Familiarity with vector search and multi-agent orchestration
Apply now to join this high growth and award-winning organisation with the opportunity to be part of building the future of AI driven projects and solutions. The role offers a highly competitive salary and benefits package and will be office based in Leicestershire.
MLE(phone number removed)AM
INDAM