Guillermo Hoyo Bravo

GenAI Engineer — Madrid, Spain

guillehoyob@gmail.com · github.com/guillehoyob · linkedin.com/in/guillermo-bravo

Professional summary

GenAI engineer with around two years building production AI systems. I design, build, evaluate, and lead RAG architecture and multi-agent orchestration — from retrieval pipelines and ingestion libraries to evaluation suites that gate every deploy. I work AI-pair-engineering as a documented method: I direct the design and review the build commit by commit, so the human owns the decisions and the AI accelerates the work.

Experience

AI Engineer — IDEA

[PLACEHOLDER — dates to confirm]

Sole AI engineer at IDEA: designed, built, and leads the company’s production RAG platform.

Designed, built, and now lead a production RAG platform on Docker, Azure CI/CD, PostgreSQL, and Qdrant, with an endpoint factory and reusable ingestion/retrieval libraries so new assistants ship from shared components.
Integrated a RAGAS evaluation module and Langfuse observability so retrieval and answer quality are measured, not assumed.
Shipped two assistants to production — an HR assistant and an electrical-engineering-rules assistant — with strong measured results [PLACEHOLDER — metrics to confirm].
Mentor an engineer building a LangGraph agent on the platform.

RAG Chatbot Lead (AI-assisted) — Zelebrix

[PLACEHOLDER — ongoing, to confirm]

Joined to lead one AI feature — a RAG chatbot for a multi-tenant payments SaaS. Built and validated; scheduled to deploy to production in [PLACEHOLDER — target window, ~a few months]. Ongoing collaboration, role still being defined (currently unpaid).

Designed and led the build of the RAG chatbot, AI-assisted and reviewed commit by commit.
Built hybrid retrieval: dense pgvector plus Spanish Postgres full-text search, fused with Reciprocal Rank Fusion, then a cross-encoder reranker and a relevance gate.
Designed privacy-by-design: PII scrubbing before the LLM, and data queries executed locally so the external LLM never sees tenant data — a boundary that holds by construction.
Built a custom evaluation suite — a 150-question golden set, a retrieval probe, and a RAGAS-lite scorer — and an LLM benchmark that showed with data the bottleneck was retrieval (synonyms), not the generation model.

Data Scientist, GenAI — Gestamp, Madrid

03/2024 – Present

Extended a production multi-agent system on Semantic Kernel; added plugins for Text-to-SQL (Databricks), document extraction (Azure Blob Storage), translation, SharePoint lookup, and a Service Desk SOAP service in under four weeks.
Built a Text-to-SQL agent with an enriched metadata layer and a custom evaluation framework with a golden dataset — 100% accuracy on controlled tests before every deploy. Implemented RBAC impersonation so every query runs under the requesting user’s identity.
Built FastAPI endpoints with async token streaming, Pydantic-validated I/O, selective per-plugin memory (MongoDB), and end-to-end impersonation; deployed via Azure DevOps CI/CD to AKS.
Contributed to a hybrid RAG pipeline on Azure AI Search (semantic + BM25 + cross-encoder reranking) with permission-aware retrieval; reviewed intern PRs and set plugin contribution criteria.

Data Analyst — FTI Consulting, Madrid

01/2024 – 03/2024

Automated data cleaning and processing for large datasets with Python and KNIME; researched OCR digitisation (Pytesseract, OpenCV, Azure AI Studio).

Education

MSc in Data Science — Autonomous University of Madrid (UAM), 2021–2023. GPA 7.63/10. Time-series forecasting, deep learning, NLP, optimization.
BSc in Computer Engineering — UAM, 2016–2021. Erasmus at the University of Bergen, Norway. GPA 7.03/10.

Certifications

Hugging Face — “Fine-tuning Language Models” (Oct 2025)
AlgoExpert — “Machine Learning Expert” (Sep 2025)

Languages

Spanish — native
English — B2; bilingual education, Erasmus in Norway
Vietnamese — learning

Skills

GenAI & RAG: Semantic Kernel, Azure OpenAI, hybrid RAG (vector + BM25 + reranking), multi-agent orchestration, prompt engineering, RAGAS evaluation, cross-encoder rerankers, Reciprocal Rank Fusion, LangGraph, Langfuse.

Cloud & Data: Azure (AI Search, Blob Storage, Key Vault, Document Intelligence, AKS, DevOps), Databricks, Docker, CI/CD, PostgreSQL/pgvector, Qdrant.

Programming: Python (FastAPI, Pandas, PySpark, Scikit-learn), SQL, Power BI.