Visão Geral
Este curso aborda os fundamentos, arquitetura, implementação e operação de bancos de dados vetoriais utilizados em aplicações modernas de Inteligência Artificial. O participante aprenderá como armazenar, indexar e recuperar embeddings de forma eficiente para suportar casos de uso como Retrieval-Augmented Generation (RAG), busca semântica, sistemas de recomendação, agentes de IA e aplicações baseadas em Large Language Models (LLMs). O curso explora as principais plataformas do mercado, estratégias de indexação, otimização de consultas, escalabilidade, segurança e governança de dados vetoriais em ambientes corporativos.
Conteúdo Programatico
Module 1: Introduction to Vector Databases
- Evolution of data storage for AI
- Fundamentals of vector databases
- Vector search concepts
- AI and Generative AI use cases
- Benefits and limitations
- Vector database ecosystem overview
Module 2: Embeddings Fundamentals
- Understanding embeddings
- Text embeddings
- Image embeddings
- Embedding generation models
- Similarity and distance calculations
- Embedding quality evaluation
Module 3: Vector Mathematics and Similarity Search
- Vector space concepts
- Cosine similarity
- Euclidean distance
- Dot product similarity
- Nearest neighbor search
- Similarity ranking techniques
Module 4: Vector Indexing Techniques
- Indexing fundamentals
- Approximate Nearest Neighbor (ANN)
- HNSW indexing
- IVF indexing
- Product Quantization concepts
- Performance optimization strategies
Module 5: Popular Vector Database Platforms
- Pinecone architecture
- Weaviate fundamentals
- Milvus platform overview
- Qdrant architecture
- Chroma database concepts
- Platform comparison methodologies
Module 6: Data Ingestion and Management
- Embedding pipelines
- Data ingestion workflows
- Metadata management
- Data synchronization techniques
- Data lifecycle management
- Versioning strategies
Module 7: Semantic Search Architectures
- Semantic search fundamentals
- Enterprise search solutions
- Hybrid search architectures
- Query optimization techniques
- Search relevance improvement
- Search performance evaluation
Module 8: Vector Databases for RAG
- RAG architecture integration
- Knowledge retrieval workflows
- Context enrichment strategies
- Retrieval optimization techniques
- Grounded response generation
- Enterprise RAG use cases
Module 9: Scalability and Performance Engineering
- Distributed vector databases
- Horizontal scaling techniques
- Capacity planning
- Query performance optimization
- Resource utilization management
- Cost optimization strategies
Module 10: Security and Governance
- Data protection principles
- Access control mechanisms
- Multi-tenant architectures
- Compliance requirements
- Governance frameworks
- Audit and monitoring practices
Module 11: Enterprise Integration Patterns
- API integration strategies
- LLM integration architectures
- Agentic AI integration
- Data platform connectivity
- Cloud-native deployment patterns
- Enterprise architecture considerations
Module 12: Vector Databases for AI Workshop
- Embedding generation exercises
- Vector indexing laboratories
- Semantic search implementation
- RAG integration projects
- Performance tuning activities
- Final vector database solution project