AI Engineering Consultant/Architect
About Cognita Reply:
At Cognita Reply, we help organisations unlock the power of AI — turning pilots into scale, and curiosity into confident adoption. We work with organisations that want to move beyond experimentation and help their people use tools like ChatGPT in practical, responsible, everyday ways.
We’re a small, fast-growing consultancy within the Reply Group, which means you’ll get exposure to real client work early on, while being supported by experienced colleagues as you build your confidence and skills.
Role Overview:
We’re looking for a hands-on AI Engineering Consultant with expertise in applied AI, agentic systems, and software engineering.
You’ll design and deliver end-to-end AI solutions, working closely with clients and internal teams to take ideas from concept to production. This includes agentic systems, bespoke applications, and early-stage (0–1) product development, primarily using OpenAI technologies.
We’re particularly interested in engineers who combine technical depth with real-world impact, and who are keen to grow into broader architectural and leadership responsibilities.
While OpenAI is central to our work, we also value curiosity and awareness of the wider LLM ecosystem to help make thoughtful, well-rounded technical decisions.
Responsibilities:
- Design and build AI applications using OpenAI and related technologies
- Develop production-ready systems across APIs, backend services, and user-facing applications
- Deliver GenAI solutions including RAG pipelines, tool use, and evaluation frameworks
- Contribute across the full solution lifecycle: design, build, testing, deployment, and iteration
- Support development of agentic systems (multi-agent workflows, orchestration, human-in-the-loop)
- Optimise performance, cost, and maintainability in cloud environments (AWS/Azure)
- Work closely with clients to shape practical, business-aligned solutions
- Communicate technical decisions clearly to technical and non-technical stakeholders
About the candidate:
- Strong hands-on software engineering experience, particularly Python and backend/API development
- Proven experience building production-ready systems and practical delivery of LLM / GenAI applications (e.g. RAG, tool use, structured outputs, evaluation)
- Ability to work across the full delivery lifecycle
- Cloud experience in AWS or Azure
- Strong engineering practices (testing, CI/CD, version control, secure development)
- Comfortable in client-facing environments, translating business needs into solutions
- Experience with OpenAI or similar LLM platforms and exposure to agentic architectures (multi-agent systems, orchestration, HITL workflows)
- Experience with vector databases and familiarity with PyTorch or TensorFlow
- 0–1 product development experience
Reply is an Equal Opportunities Employer and committed to embracing diversity in the workplace. We provide equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type regardless of age, sexual orientation, gender, identity, pregnancy, religion, nationality, ethnic origin, disability, medical history, skin colour, marital status or parental status or any other characteristic protected by the Law.
Reply is committed to making sure that our selection methods are fair to everyone. To help you during the recruitment process, please let us know of any Reasonable Adjustments you may need.