Projects

Work I'm proud of

Systems and products built to solve real problems at scale. Each one taught me something new about the craft.

Open Source

GitHub

A selection of open-source experiments and projects exploring AI, machine learning, and systems engineering. Full collection at chemacabeza/my-github-projects.

🎩

Personal AI Assistant

Alfred · Bilingual (EN/ES) · Multi-account Gmail, Drive & Photos · Native WA Bridge · Atomic Session Reset · Android Auto

"At your service, sir!" A production-ready, full-stack AI butler that securely connects to multiple Google Accounts via OAuth 2.0 to manage your Gmail, Drive, Photos, Calendar, and Maps concurrently — and now connects natively to your WhatsApp via a custom Dockerized bridge (Baileys), with no Meta API tokens required. Simply scan a QR code in the high-fidelity dark-mode dashboard (mirroring WhatsApp Web) to link your personal account. Recently updated with a "nuclear" atomic session reset to purge stale connections instantly. The AI assistant is fully bilingual (English and Spanish), capable of aggregating and searching across multiple Gmail inboxes and Drive accounts, sending WhatsApp messages on your behalf, deleting calendar events, and automatically generating Android Auto navigation links for any address in your schedule. Powered by OpenAI for intelligent intent parsing, it features custom email reply templates, geocoding via Google Maps, and AES-encrypted token storage at rest. Includes a streamlined dashboard with 1-click links to enable and configure the required Google APIs effortlessly. Get started in 5 commands: git clone → ./build.sh → scan QR → Alfred is live at localhost:5173. Fully containerized: Spring Boot 3.4 + React 18 + Node.js Bridge + PostgreSQL 16.

Spring Boot 3.4React 18BilingualMulti-account GmailNative WA BridgeSocket.ioOpenAIQR AuthenticationAndroid AutoGoogle OAuth2Gmail APIDrive APIPhotos APICalendar APIMaps APIPostgreSQL 16Docker
🚀

LLM Engineering Mastery

38 chapters · Feynman Technique · RAG & Agentic AI

A deeply detailed, technically rigorous 38-chapter curriculum built using the Richard Feynman Technique. Synthesized from a 10-book technical library, this mastery sequence breaks down complex concepts in LLM Engineering—from transformer math and fine-tuning (LoRA/DPO) to advanced RAG pipelines, multimodal vision fusion, and local-first AI orchestration. Now featuring advanced modules on Multi-Agent Systems, Human-AI Co-Intelligence, and the path to robust, common-sense AI. Including core safety modules on LLM Security and Threat Modeling. Featuring layered narrative analogies and 100+ custom visual infographics, this curriculum takes you from the foundational next-token prediction logic into the deep deployment of production-ready, autonomous agentic systems.

LLMsRAGAgentic AIOllamaHuggingFacePythonDocker
🧠

ML Inference Pipeline

Train in Python → Export to ONNX → Serve in Java

A complete, production-ready machine learning pipeline that trains a neural network in Python (PyTorch), exports it to ONNX, and serves it via a Spring Boot API with 'Golden Parity' tests. This automated architecture ensures mathematically identical results between Python and Java runtimes, proving that models can be trained in Python but executed with zero-Python production footprints.

PyTorchONNXSpring Boot 3ReactGolden ParityDocker

AI Video Generation Studio

Hybrid Architecture · Online & Offline Video Synthesis

A full-stack video platform supporting 5 state-of-the-art models (Wan 2.6, Kling 2.5, LTX-2, PixVerse) via fal.ai. Engineered with a flexible hybrid architecture: a single configuration switch lets you toggle between running models entirely offline on local hardware or connecting to cloud APIs for scale. Features async job orchestration, isolated Docker environments, and zero-downtime database migrations with automated health checks.

Spring Boot 3.4React 18Hybrid CloudLocal Inferencefal.aiWan 2.6Kling 2.5Docker
🔊

AI Voice Studio

Text-to-speech Generator · OpenAI TTS

Convert text to natural-sounding speech instantly using the OpenAI TTS API. Choose from 11 expressive voices and stream raw MP3 audio straight to your browser. The secure architecture keeps API keys strictly server-side, while the Spring Boot backend and React frontend are fully Dockerized for effortless, horizontal scaling in any environment.

Spring Boot 3React 18OpenAI TTSMP3 StreamingDocker
🎧

AI Audio Listener

Hybrid Architecture · Online & Offline Transcription

A high-performance transcription pipeline powered by OpenAI Whisper. This hybrid platform allows you to toggle instantly between local-only transcription for air-gapped privacy or cloud acceleration via OpenAI APIs. Featuring a Next.js 15 App Router frontend and a Spring Boot 3.3 backend with PostgreSQL, it includes searchable history, file uploads, and browser-based recording updates.

Spring Boot 3.3Next.js 15PostgreSQLWhisperDockerFlyway
🤖

AI Voice Assistant

Conversational AI · Whisper + GPT-4o + TTS

"Hey Nova, tell me a joke." Speak naturally, and this full-stack AI assistant listens (Whisper), thinks (GPT-4o), and speaks back (TTS) — all in real time, right in your browser. Features wake word detection ('Nova'), 9 languages, 6 voice personas, custom AI naming, push-to-talk and continuous listening modes, and both online and offline backends. It's your own Alexa — but you built it, you control it, and it runs with a single Docker command.

Spring Boot 3.4React 18ViteOpenAIGPT-4oWhisperTTSDocker

Bash In Depth

A Developer's Guide · 7-part living book

A structured deep-dive into Bash systems engineering — from variables and control flow to process management and signals. This 7-part living book is written for engineers who want to go beyond quick scripts and truly master the shell that holds systems together. Features chapters on subshells, Coprocesses, and Programmable Completion.

BashSystems EngineeringLinuxDevOpsAutomation

Java + Spring Boot In Depth

24-part deep dive · Java (13) + Spring Boot (11) + Advanced Topics

A powerhouse mastery curriculum split into two comprehensive tracks. The 13-part Java track covers Fundamentals, OOP Essentials, Core APIs, Exception Handling, Generics & Type Safety, Lambda Expressions, Streams API, Concurrency & Multithreading, I/O & NIO, Advanced OOP & Design Patterns, Modern Java Features, JNI & Project Panama, and JVM Troubleshooting & Tuning. The 11-part Spring Boot track covers Spring Core & JPA CRUD, REST CRUD APIs, REST API Security, Spring MVC, JPA Advanced Mappings, AOP, Hibernate Deep Dive, and AspectJ Deep Dive. Also includes advanced standalone topics: Apache Kafka Integration, Spring Modulith, Model Context Protocol (MCP) for exposing Java tools natively to LLMs, and High-Performance gRPC Microservices.

Java 21Spring Boot 3JVM TuningHibernateAspectJKafkagRPCModulithMCPDocker
🔬

Advanced AI Curriculum

20 chapters · 7 phases · Theoretical & Distributed AI

A comprehensive, senior-level systems engineering guide synthesized from 10+ foundational textbooks (Goodfellow, Géron, Rashid, Nielsen, d2l.ai) diving into the deepest technical layers of modern AI. Spans 7 phases: hardware compilation & acceleration, Federated Learning, Graph Neural Networks, NLP pipelines, Autonomous Vehicle Perception, Biometric Security, Symbolic AI, AI Agents (RAG & Multiagent Systems), and Deep Learning Foundations covering Transformers, RNN/LSTM, Transfer Learning, TorchScript deployment, and Mixed Precision training across TensorFlow and PyTorch. Includes Dockerized exercises in both Python and Java.

Deep Learning SystemsTensorFlowPyTorchTransformersFederated LearningGNNAI AgentsRL
🧬

AI & Machine Learning Course

27 chapters · Hands-on · Fully Dockerized exercises

A hands-on, beginner-friendly curriculum synthesized from over 50 academic textbooks spanning 27 chapters of modern AI. Every chapter includes a fully Dockerized exercise — no local setup beyond Docker. Covers neural networks from scratch, Federated Learning, Reinforcement Learning, NLP, SVMs, Swarm AI, Computer Vision (CNNs), Transfer Learning, Semi-Supervised Learning, Autonomous Vehicles, Algorithmic Trading, Graph Neural Networks, DQN for games, Active Learning, Adversarial ML defenses, Fake News Detection, IoT Edge AI, Lifelong Learning, Metric Learning, ML Data Pipelines, Game Theory, Bayesian Learning, Ensemble Methods, Dimensionality Reduction, Statistical Learning Theory, Kernel Methods, and Time Series Forecasting.

Deep LearningReinforcement LearningComputer VisionNLPAdversarial MLBayesian LearningEnsemble MethodsTime SeriesDocker
🛡️

CyberSecurity Curriculum

20 chapters · Red & Blue Team · 24 textbooks · Dockerized exercises

A comprehensive, hands-on curriculum synthesized from 24 leading industry textbooks taking you from fundamental defensive principles to advanced offensive red team operations. Every chapter follows a rigorous pedagogical structure: a clear learning goal, distilled core concepts, interactive reflection questions with hidden answers, and a fully Dockerized reproducible exercise (Python or Bash) runnable in isolation. The 20 chapters cover: Security Foundations & CIA Triad, Threat Landscape, Networking for Security, Linux Hardening, Windows Hardening, Cryptography & PKI, Network Traffic Analysis, Penetration Testing Methodology, Exploitation Frameworks & Payloads, Web Application Security, Red Team Operations & Kill Chain, Malware Analysis, Reverse Engineering with Ghidra, Digital Forensics & Evidence Collection, Incident Response & SOC Operations, Cloud Security (AWS & Azure), Container & Kubernetes Security, Mobile & IoT Forensics, Threat Hunting & Intelligence, and Security Automation with Python.

CyberSecurityRed TeamBlue TeamPenetration TestingGhidraForensicsCloud SecurityKubernetesPythonDocker
🧮

Coding Algorithm

16 chapters · Visual Guides · Java, Python & C++

A comprehensive collection of 16 algorithm chapters with visual explanations, step-by-step diagrams, and complete implementations in Java, Python, and C++. Built from insights across classic computer science texts including Introduction to Algorithms (CLRS), Grokking Algorithms, The Algorithm Design Manual, and Cracking the Coding Interview. Covers Binary Search, Selection Sort, Merge Sort, Quicksort, BFS, Dijkstra's Algorithm, Dynamic Programming, Hash Tables, DFS, Hungarian Algorithm, Greedy Algorithms, Heapsort, Topological Sort, K-Nearest Neighbors (KNN), A* Search, and Bloom Filters — each with realistic hero images, execution diagrams, mathematical foundations, and tri-language code.

AlgorithmsData StructuresJavaPythonC++Graph AlgorithmsDynamic ProgrammingSortingPathfindingMachine LearningProbabilistic Data StructuresMathematics
⚙️

C++ In Depth

Advanced Systems Programming · Mastery Sequence

A 12-part sequence in high-performance C++ systems engineering. Covers the C++ Object Model under the hood, memory architecture, virtual tables, and template metaprogramming (SFINAE). Explores policy-based design and assembly-level reversing, culminating in a Mastery Project focusing on type-erased thread-safe containers.

C++Memory ArchitectureMetaprogrammingSTLAssembly
Go

Golang Mastery Curriculum

12 chapters · 4 phases · 100% Dockerized · 17 textbooks

A complete path from absolute beginner to production-ready multi-container microservices, synthesized from 17 professional-grade textbooks on gRPC, Concurrency, TDD, and DDD. Every module is 100% Dockerized with FROM scratch multi-stage builds — no local Go installation needed. Phase 1 (Foundations) covers Basics & Environment, Structs & Interfaces, and Go's explicit Error Handling with defer/panic/recover. Phase 2 (Advanced Mechanics) tackles the CSP Concurrency model (Goroutines, Channels, select), Functional Programming with Go 1.18+ Generics, and Test-Driven Development with Dependency Injection. Phase 3 (Systems & Data) covers Data Structures & Algorithms (Slice headers, Generic Queues), System Programming (POSIX, graceful SIGTERM shutdowns, bufio disk streaming), and building zero-dependency CLI binaries with Cobra. Phase 4 (Enterprise Microservices Capstone) implements Domain-Driven Design with Hexagonal Architecture (Ports & Adapters), replaces REST with gRPC over Protocol Buffers on HTTP/2, and culminates in a full multi-container microservice system linking an HTTP Gateway to a backend gRPC Core Service via Docker Compose.

GogRPCProtobufsCSP ConcurrencyTDDDDDHexagonal ArchitectureGenericsCLIDocker
🏗️

System Design Mastery

58 chapters · 13 phases · Scale to Millions

How do you design YouTube? How does WhatsApp deliver a message in 30 ms? This 58-chapter curriculum, distilled from over 48 professional-grade textbooks (DDIA, Alex Xu's Volumes, API Security in Action, Learning Serverless, Serverless Architectures on AWS, Model Context Protocol), takes you systematically through 13 phases: Phase 1 covers Scalability, Databases, Caching & Networking. Phase 2 dives into Distributed Systems theory (CAP, Raft, Saga). Phase 3 covers Building Blocks (Kafka, Gateways, Microservices). Phase 4 covers Architecture (CQRS, Event Sourcing, Observability, CI/CD). Phase 5 tackles end-to-end designs (URL Shortener, Chat, News Feed, Video). Phase 6 covers Advanced Patterns (Strangler Fig, Micro-Frontends). Phase 7 covers Trade-Offs & the C4 Model. Phase 8 delivers advanced designs (Proximity/Quadtrees, TSDBs, Kafka Deep Dive). Phase 9 focuses on robust API Architecture (REST, OpenAPI, GraphQL, OAuth/JWT). Phase 10 introduces Serverless Architecture & Cloud-Native Patterns. Phase 11 tackles Data-Intensive Systems & Flow Architectures. Phase 12 covers API Ecosystems & Professional Practice. Phase 13 covers Model Context Protocol (MCP) & AI Agent Architectures. Concludes with the ultimate System Design Interview Framework.

DDIADistributed SystemsServerlessAWSKafkaREST APIGraphQLCQRSC4 ModelMicroservicesMCPAI Agents
🏛️

Domain-Driven Design (DDD)

7 chapters · Strategic & Tactical Patterns · Java

A curriculum synthesizing the foundational literature of Domain-Driven Design (Eric Evans, Vladik Khononov) to tackle complexity in the heart of software. It teaches how to bridge the gap between domain experts and developers using a Ubiquitous Language. Covers Strategic Design (Bounded Contexts, Context Mapping), Tactical Design (Entities, Value Objects, Aggregate Roots), Domain Events, CQRS with Event Sourcing, and a comprehensive visual encyclopedia of all DDD Building Blocks. All code examples are implemented in modern Java (17+).

DDDJavaArchitectureCQRSEvent SourcingStrategic Design
🐧

Linux Mastery Curriculum

85 chapters · 23 phases · Kernel to UNIX Power Tools

A hardcore 85-part curriculum synthesized from 34 professional-grade textbooks (including Systems Performance, BPF Performance Tools, TLPI, and Advanced Programming in the UNIX Environment). Spans 23 phases: Phases 1-3 cover Foundations, Sysadmin automation, and Ring-0 Kernel Systems Programming natively in C. Phase 4 is an Extreme Observability capstone using eBPF and the USE Method. Phases 5-11 dive deeply into The UNIX Programming Interface (TLPI), Containerization Internals (Namespaces/Cgroups built from scratch), VFS & Memory Management (Page Caches/mmap), Kernel Extension (FUSE/Netfilter), Security (SELinux/Seccomp), and High-Performance Networking (XDP/DPDK kernel-bypass). Phases 12-16 handle Production Engineering, DNS/Web administration, and deep CLI text processing (Sed/Awk). Phases 17-21 scale to Cybersecurity Operations, Digital Forensics, TCP/IP Deep Dives, and Brenda Gregg's Performance analysis (Flame Graphs/Latency). Finally, Phases 22-23 culminate in Software Dynamics, Latency Tracing, and absolute mastery of the GNU Build Ecosystem (Autotools & Make) and UNIX Power Tools. Packed with heavily hands-on, securely isolated execution exercises.

LinuxKernel DevelopmenteBPFSystems ProgrammingCybersecurityPerformanceDocker InternalsCBashGNU Autotools