About Us:
Saguna Consulting partners with organizations to transform bold ideas into scalable, long-lasting solutions through a blend of strategy, technology, and execution. With a human-centered and outcome-driven approach, Saguna collaborates with product-led teams, founders, and enterprises to deliver practical, intuitive, and impactful solutions. From early-stage concepts to production-ready platforms, the company acts as a trusted advisor, builder, and long-term partner.
Overview:
We are seeking an experienced AI Engineer with 5+ years of expertise in Machine Learning, Large Language Models (LLMs), AI Agent Development, Automatic Speech Recognition (ASR/STT), and Text-to-Speech (TTS) technologies. The ideal candidate will be responsible for designing, developing, and deploying intelligent AI solutions, conversational systems, and voice-based applications that drive business innovation and enhance user experiences.
Key Responsibilities:
- Design, develop, and deploy AI/ML models and Generative AI solutions.
- Build applications using LLMs (GPT, Llama, Claude, Mistral, etc.) and optimize their performance.
- Develop AI Agents using LangChain, LangGraph, CrewAI, AutoGen, or similar frameworks.
- Implement Retrieval-Augmented Generation (RAG) pipelines leveraging vector databases and knowledge retrieval systems.
- Strong knowledge of Prompt Engineering and prompt optimization techniques
- Design and integrate ASR/STT and TTS solutions for voice-enabled applications.
- Fine-tune foundation models and optimize inference for scalability and efficiency.
- Build APIs, microservices, and cloud-based AI applications.
- Collaborate with cross-functional teams to deliver AI-driven business solutions.
- Monitor model performance and continuously improve AI systems.
- Stay updated with advancements in Generative AI, NLP, Voice AI, and Agentic AI technologies.
Required Qualifications:
- Bachelor's or Master's degree in Computer Science, AI, Machine Learning, Data Science, or a related field.
- 5+ years of experience in AI/ML development and deployment.
- Strong proficiency in Python, PyTorch, TensorFlow, Scikit-learn, and Hugging Face.
- Hands-on experience with LLMs, Prompt Engineering, Fine-tuning, RAG, Embeddings, and Vector Databases (Pinecone, Chroma, FAISS, Milvus, etc.).
- Experience building AI Agents using LangChain, LangGraph, CrewAI, AutoGen, or similar frameworks.
- Strong knowledge of NLP, ASR/STT (Whisper, Deepgram, Azure Speech, etc.), and TTS technologies (ElevenLabs, Coqui, Google TTS, etc.).
- Experience with cloud platforms (AWS, Azure, or GCP), Docker, Kubernetes, CI/CD, and MLOps practices.
- Familiarity with FastAPI, Flask, REST APIs, and microservices architecture.
- Excellent analytical, problem-solving, and communication skills.
Preferred Technical Exposure:
- LLM
- AI/ML
- NLP
- Fine-Tuning
- RAG
- AI Agents
- ASR/STT/TTS
- Python
- Machine Learning
- AWS , Azure or GCP
- Docker , Kubernets , CI/CD Pipeline
Would you like to grow forward together?