Posted:1 month ago|
Platform:
Remote
Full Time
Develop and manage scalable deployment strategies specifically tailored for LLMs (GPT, Llama, Claude, etc.).
Optimize LLM inference performance, including model parallelization, quantization, pruning, and fine-tuning pipelines.
Integrate prompt management, version control, and retrieval-augmented generation (RAG) pipelines.
Manage vector databases, embedding stores, and document stores used in conjunction with LLMs.
Monitor hallucination rates, token usage, and overall cost optimization for LLM APIs or on-prem deployments.
Continuously monitor models for its performance and ensure alert system in place.
Ensure compliance with ethical AI practices, privacy regulations, and responsible AI guidelines in LLM workflows.
Gainwell Technologies
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Practice Python coding challenges to boost your skills
Start Practicing Python NowBengaluru
25.0 - 35.0 Lacs P.A.
jaipur
2.5 - 5.0 Lacs P.A.
ahmedabad
Experience: Not specified
0.48 - 0.6 Lacs P.A.
pune, bengaluru, delhi / ncr
30.0 - 45.0 Lacs P.A.
ahmedabad
Experience: Not specified
Salary: Not disclosed
pune, bengaluru
45.0 - 70.0 Lacs P.A.
gurugram, mumbai (all areas)
15.0 - 22.5 Lacs P.A.
12.0 - 20.0 Lacs P.A.
bengaluru
12.0 - 17.0 Lacs P.A.
20.0 - 25.0 Lacs P.A.