Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
3.0 - 7.0 years
0 Lacs
chennai, tamil nadu
On-site
As a hands-on backend expert, you will be responsible for taking our FastAPI-based platform to the next level by building production-grade model-inference services, agentic AI workflows, and seamless integration with third-party LLMs and NLP tooling. Please note that this role is being hired for one of our client companies, and the company name will be disclosed during the interview process. In this role, you will have the opportunity to work on the following key areas: Core Backend Enhancements: - Build APIs - Harden security with OAuth2/JWT, rate-limiting, SecretManager, and observability with structured logging and tracing - Implement CI/CD, test automation, health checks, and SLO dashboards Awesome UI Interfaces: - Develop UI interfaces using React.js/Next.js, Redact/Context, and various CSS frameworks like Tailwind, MUI, Custom-CSS, and Shadcn LLM & Agentic Services: - Design micro/mini-services to host and route to OpenAI, Anthropic, local HF models, embeddings, and RAG pipelines - Implement autonomous/recursive agents that orchestrate multi-step chains including Tools, Memory, and Planning Model-Inference Infrastructure: - Set up GPU/CPU inference servers behind an API gateway - Optimize throughput with batching, streaming, quantization, and caching using technologies like Redis and pgvector NLP & Data Services: - Own the NLP stack focusing on Transformers for classification, extraction, and embedding generation - Build data pipelines that combine aggregated business metrics with model telemetry for analytics You will be working with the following tech stack: - Python, FastAPI, Starlette, Pydantic - Async SQLAlchemy, Postgres, Alembic, pgvector - Docker, Kubernetes, or ECS/Fargate on AWS or GCP - Redis, RabbitMQ, Celery for jobs and caching - Prometheus, Grafana, OpenTelemetry - HuggingFace Transformers, LangChain, Torch, TensorRT - OpenAI, Anthropic, Azure OpenAI, Cohere APIs - Pytest, GitHub Actions - Terraform or CDK To be successful in this role, you must have: - 3+ years of experience building production Python REST APIs using FastAPI, Flask, or Django-REST - Strong SQL schema design and query optimization skills in Postgres - Deep knowledge of async patterns and concurrency - Hands-on experience with UI applications that integrate with backend APIs - Experience with RAG, LLM/embedding workflows, prompt-engineering, and agent-ops frameworks - Cloud container orchestration experience - Proficiency in CI/CD pipelines and infrastructure-as-code Nice-to-have experience includes familiarity with streaming protocols, NGINX Ingress, RBAC, multi-tenant SaaS security, data privacy, event-sourced data models, and more. This role is crucial as our products are live and evolving rapidly. You will have the opportunity to own systems end-to-end, scale AI services, work closely with the founder, and shape the future of our platform. If you are seeking meaningful ownership and enjoy working on challenging, forward-looking problems, this role is perfect for you.,
Posted 1 week ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
31458 Jobs | Dublin
Wipro
16542 Jobs | Bengaluru
EY
10788 Jobs | London
Accenture in India
10711 Jobs | Dublin 2
Amazon
8660 Jobs | Seattle,WA
Uplers
8559 Jobs | Ahmedabad
IBM
7988 Jobs | Armonk
Oracle
7535 Jobs | Redwood City
Muthoot FinCorp (MFL)
6170 Jobs | New Delhi
Capgemini
6091 Jobs | Paris,France