Posted:1 day ago|
Platform:
Remote
Full Time
Develop and manage scalable deployment strategies specifically tailored for LLMs (GPT, Llama, Claude, etc.).
Optimize LLM inference performance, including model parallelization, quantization, pruning, and fine-tuning pipelines.
Integrate prompt management, version control, and retrieval-augmented generation (RAG) pipelines.
Manage vector databases, embedding stores, and document stores used in conjunction with LLMs.
Monitor hallucination rates, token usage, and overall cost optimization for LLM APIs or on-prem deployments.
Continuously monitor models for its performance and ensure alert system in place.
Ensure compliance with ethical AI practices, privacy regulations, and responsible AI guidelines in LLM workflows.
Gainwell Technologies
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Practice Python coding challenges to boost your skills
Start Practicing Python NowInformation Technology and Services
approximately 5,000 Employees
117 Jobs
Key People
Bengaluru
25.0 - 35.0 Lacs P.A.
Hyderabad
8.0 - 13.0 Lacs P.A.
13.0 - 18.0 Lacs P.A.
New Delhi, Lucknow
2.0 - 6.0 Lacs P.A.
Bangalore Rural
4.5 - 5.5 Lacs P.A.
11.0 - 20.0 Lacs P.A.
New Delhi, Lucknow
3.0 - 8.0 Lacs P.A.
Hyderabad, Chennai
2.5 - 7.5 Lacs P.A.
2.5 - 3.5 Lacs P.A.
Mumbai
2.5 - 3.5 Lacs P.A.