Learn about the whole lifecycle of LLMs and generative AI applications, from data preparation and model training to local deployment, fine-tuning, RAG systems, agents, and responsible governance.
Building and Deploying LLMs (Large Language Models)

Training details
Location
Aula Master: Edifici C6 Room E106, Campus Nord UPC C. Jordi Girona, 1-3 08034 Barcelona, Spain
Start Date
06/07/2026
Time
10 : 00
End Date
10/07/2026
Target Audiance
Scientist
Teaching language(s)
English
Organizing institution
Barcelona Supercomputing Center
Delivery mode
On-site
Level
Introductory
Format
Hands-on session, Lecture
Topics / Keywords
LLMs, Deep Learning, Machine Learning, AI, NLP
What You Will Learn
By the end of the course, participants will be able to:
– understand how LLMs are built, adapted, and executed;
– analyze datasets from a technical, legal, and ethical perspective;
– distinguish between prompting, RAG, fine-tuning, agents, and local models;
– run open models locally using current tools;
– design applications with information retrieval, tools, and workflows;
– evaluate quality, safety, risks, and feasibility of a generative solution;
– apply basic criteria for governance, regulatory compliance, and responsible use.
Agenda
10:00h-13h each day
Day 1: LLM fundamentals, pretraining, datasets, and licensing
- Introduction to LLMs, generative AI, and foundation models.
- Core concepts: tokens, Transformers, next-token prediction, scale, and compute.
- LLM lifecycle: pretraining, post-training, fine-tuning, evaluation, and deployment.
- Training data: public datasets, proprietary data, and synthetic data.
- Quality, cleaning, deduplication, bias, and dataset documentation.
- Licenses, copyright, personal data, and usage restrictions.
- Brief introduction to modern architectures such as Mixture of Experts.
Day 2: Post-training, fine-tuning, alignment, and evaluation
- Differences between pretraining, continued pretraining, instruction tuning, and fine-tuning.
- Supervised Fine-Tuning, LoRA, QLoRA, and PEFT.
- Alignment and preference-based RL methods: DPO, RLHF, GRPO.
- Dense models vs. Mixture of Experts: architecture, efficiency, and cost.
- Criteria for choosing between prompting, RAG, fine-tuning, or changing the model.
- LLM evaluation: benchmarks, golden datasets, rubrics, human and automated evaluation.
- Evaluation of hallucinations, safety, and model behavior.
Day 3: Local execution of LLMs: Ollama, llama.cpp, GGUF, quantization, and serving
- Motivations for running models locally: privacy, prototyping, cost, and control.
- Hardware requirements: CPU, GPU, memory, context size, and latency.
- Quantization and model formats, especially GGUF.
- Using tools such as Ollama and llama.cpp.
- Open models and local experimentation.
- Model serving and local APIs.
- Introduction to vLLM and comparison between local, cloud, external APIs, and HPC infrastructure.
Day 4: RAG, context engineering, tools, agents, and harnesses (Prof. Fabrício Carraro)
- Limitations of standalone LLMs.
- Prompt engineering vs. context engineering.
- RAG: document ingestion, chunking, embeddings, retrieval, reranking, and generation.
- When to use RAG versus fine-tuning.
- Tool calling and integration with external systems.
- Workflows vs. agents.
- Multi-agent flows.
- Current frameworks: LangChain, LangGraph, LlamaIndex, CrewAI, AutoGen.
- Schemas and Pydantic.
- Introduction to MCP as a standard for connecting models with tools and context.
- Risks: prompt injection, data leakage, tool misuse, and lack of traceability.
- Harnesses (Claude Code, Codex, Cursor, OpenCode)
Day 5 – Observability & ethical AI
- Principles of trustworthy AI and responsible use.
- Technical risks: hallucinations, bias, lack of robustness, and vendor dependency.
- Observability with Langfuse.
- Monitoring, red teaming, auditing, and criteria for moving from prototype to production.
- Security risks: prompt injection, data leakage, misuse of tools.
- GDPR, copyright, dataset licenses, and model licenses.
Instructor name(s)
- Fabrício Carraro
- Sven Dietz
- Martí Llopart
- Jens Grivolla
- Atia Cortés
- Luis Vasquez
Course Description
Learn about the whole lifecycle of LLMs and generative AI applications, from data preparation and model training to local deployment, fine-tuning, RAG systems, agents, and responsible governance. The training combines theoretical foundations, technical demonstrations, and independent exercises. The goal is for participants to develop the technical judgment needed to decide when to use prompting, RAG, fine-tuning, local models, external APIs, agents, or specialized infrastructure.
Prerequisites
Ideally, bring your own laptop

