About our client & the role
Our client, expert technologists who share a passion for building technology solutions responsibly, focuses on business outcomes. Their mission is to weave excellence, innovation, and sustainability into the very fabric of the digital landscape, and we specialise in Cloud, Data, AI, and emerging technologies.
They maintain solid collaborations with top-tier Independent Software Vendors (ISVs) in the US, providing them with a strong foundation to address complex challenges and deliver business outcomes.
Requirements
- Bachelor’s degree in Computer Science, Software Engineering, or related field.
- 3+ years of professional experience building enterprise-grade applications with Python.
- 1+ years of experience developing and deploying Generative AI applications using Large Language Models (LLMs) for text, image, or code generation.
- Strong knowledge of modern GenAI frameworks (e.g., LangChain, LlamaIndex) and hands-on expertise in prompt engineering, including Retrieval-Augmented Generation (RAG) with vector databases such as Pinecone, Redis, or cloud-native vector stores.
- Proven experience in serving and scaling LLMs on cloud platforms (Azure AI/ML, AWS SageMaker, GCP Vertex AI), with an emphasis on performance, latency, and cost optimization.
- Familiarity with fine-tuning pre-trained models and securely integrating APIs (e.g., OpenAI, Azure OpenAI Service) in enterprise environments.
Responsibilities
- Design and build scalable, secure Python backend applications across the SDLC.
- Collaborate with teams to deliver high-quality solutions and influence architecture.
- Develop reliable, data-driven systems using databases, APIs, and cloud services.
- Ensure robust, testable, well-documented code integrated with modern platforms.
- Mentor junior developers on best practices, optimization, and CI/CD.
- Apply latest Python frameworks and tools to solve complex engineering challenges.