Duration: 12 MonthsLocation: Philadelphia, PA
Note: Hybrid role, minimum 3 days in the office
Interview: 1st interview, 1-hour, in-person; 2nd interview, 1-hour, in-person
Job description:
Consultant Requirements – On-Prem LLM & Vector DB Implementation
Core Experience
- Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in on-prem or private environments
- Strong proficiency in Python for LLM inference, prompt engineering, and integration
- Experience with CPU-based inference, model quantization, and performance tuning
- Practical experience with open-source vector databases such as Qdrant, Chroma, Milvus, or pgvector
- Proven implementation of Retrieval-Augmented Generation (RAG) pipelines
- Experience generating and managing embeddings and metadata filtering
- Understanding of data privacy, air-gapped deployments, and enterprise security requirements
- Experience implementing access controls and audit logging
- Experience with LangChain or LlamaIndex
- Exposure to Rust, Go, or C++ for high-performance services
- Familiarity with Docker and Kubernetes for on-prem deployments
- Knowledge of inference frameworks (e.g., vLLM, llama.cpp, Hugging Face Transformers)
- Prior work in regulated or enterprise environments
- Reference architecture and deployment guidance
- Working prototype (LLM + vector DB + RAG)
- Documentation and knowledge transfer to internal teams
About Us:
Since 2000, Tri-Force Consulting Services (https://triforce-inc.com) has been an MBE/SDB certified IT Consulting firm in the Philadelphia region. Tri-Force specializes in IT staffing, software development (web and mobile apps), systems integration, data analytics, system automation, cybersecurity, and cloud technology solutions for government and commercial clients. Tri-Force works with clients to overcome obstacles such as increasing productivity, increasing efficiencies through automation, and lowering costs. Our clients benefit from our three distinguishing core values: integrity, diligence, and technological excellence. Tri-Force is a six-time winner among the fastest-growing companies in Philadelphia and a four-time winner on the Inc. 5000 list of the nation's fastest-growing companies.
Learn more about this Employer on their Career Site
