About The Role
We’re hiring a Senior AI Engineer with expertise in Computer Vision, document understanding, and voice AI to help build the brains behind our AI agents.
You’ll work on the two core components of our AI agents – first, the core perception systems that extract structured insights from messy, real-world freight documents—handwritten, scanned, distorted, or multi-page – and second, our AI agents for email and voice communications between freight entities.You will do a lot of prompt engineering, fine-tuning LLMs, building large-scale document classification and entity extraction models, communication understanding, intent classification, and voice AI – your code will be at the heart of automating financial decision-making in freight.You’ll collaborate closely with the backend and product teams to bring AI models to life in production environments and continuously improve performance in the wild.What You’ll Do
👉🏼 Build and fine-tune AI models for document classification, OCR, entity recognition, and layout parsing
👉🏼 Build AI agents for email and phone communications between different freight accounting parties – payer and payee👉🏼 Develop scalable pipelines for pre-processing, training, inference, and feedback loops👉🏼 Evaluate and integrate VLMs👉🏼 Annotate, clean, and curate diverse freight documents for robust model performance👉🏼Build training, evaluation, and test datasets👉🏼Identify issues identified in production data and fix them asap👉🏼Iterate on improving existing and new AI stack👉🏼 Productionize AI models as part of Lighthouz’s intelligent automation stack👉🏼 Collaborate with backend engineers to integrate model outputs into document, email, and voice workflows👉🏼 Continuously monitor and improve model performance in real-world conditionsWhat We’re Looking For
👉🏼 3–6 years experience in ML or AI roles, preferably focused on computer vision or document AI
👉🏼 Strong foundation in deep learning frameworks (e.g., PyTorch, TensorFlow)👉🏼 Experience in fine-tuning VLMs and LLMs👉🏼 Experience in voice AI👉🏼 Experience with document/image OCR, visual transformers, and multimodal models👉🏼 Proficiency in Python and common ML tooling (e.g., Hugging Face, OpenCV, spaCy)👉🏼 Hands-on experience training and deploying models in production👉🏼 Strong problem-solving skills and a builder mindset—you move fast and iterate faster👉🏼 Comfortable working with ambiguity and evolving datasets👉🏼 Willingness to work long hoursNice to Have
👉🏼 Familiarity with freight, logistics, or fintech workflows
👉🏼 Experience with AWS, Azure, or GCP-based ML infrastructure👉🏼 Exposure to RAG pipelines, foundation models, or vector search systems👉🏼 Knowledge of document layout understanding (e.g., Donut, LayoutLM, PubLayNet)👉🏼 Background in building secure, production-grade ML servicesWhat We Offer
💰 Competitive salary
🌎 Fully remote🛠️ High ownership, zero bureaucracy—help shape our AI stack from day one🚀 Work on impactful real-world problems that blend AI and automation at scaleSkills: communication understanding,node.js,rest apis,fine-tuning llms,voice ai,large-scale document classification,hugging face,spacy,nosql,kubernetes,document understanding,docker,postgresql,aws,ml tooling,production model deployment,sql,opencv,api,entity extraction,frontend javascript tech,intent classification,microservices,backend development,prompt engineering,deep learning frameworks,flask,ai/ml workflows,python,computer vision,ocr,event-driven architectures,mongodb