Search Results for “name” – Page 18 – C4: Container, Code, Cloud & Context

AKS Workload Identity

Posted on April 17, 2025

AKS workload identity is a feature of Azure Kubernetes Service (AKS) that enables you to use Azure Active Directory (AAD) to manage access to Azure resources from within a Kubernetes cluster. In this blog post, we’ll explore how AKS workload identity works and how to use it with an example code. How does AKS workload […]

Read more →

LlamaIndex: The Data Framework for Building Production RAG Applications

Posted on April 15, 2025

Introduction: LlamaIndex (formerly GPT Index) is the leading data framework for building LLM applications over your private data. While LangChain focuses on chains and agents, LlamaIndex specializes in data ingestion, indexing, and retrieval—the core components of Retrieval Augmented Generation (RAG). With over 160 data connectors through LlamaHub, sophisticated indexing strategies, and production-ready query engines, LlamaIndex […]

Read more →

Function Calling Deep Dive: Building LLM-Powered Tools and Agents

Posted on April 15, 2025

Introduction: Function calling transforms LLMs from text generators into action-taking agents. Instead of just describing what to do, the model can actually do it—query databases, call APIs, execute code, and interact with external systems. OpenAI’s function calling (now called “tools”) and similar features from Anthropic and others let you define available functions, and the model […]

Read more →

Quantization Methods for LLMs: GPTQ, AWQ, and BitsAndBytes

Posted on April 8, 2025

Last year, I needed to run a 13B parameter model on a 16GB GPU. Full precision required 52GB. After testing GPTQ, AWQ, and BitsAndBytes, I reduced memory to 7GB with minimal accuracy loss. After quantizing 30+ models, I’ve learned which method works best for each scenario. Here’s the complete guide to LLM quantization. Figure 1: […]

Read more →

AKS pod managed identity

Posted on April 7, 2025

Kubernetes has become one of the most popular container orchestration tools, and Azure Kubernetes Service (AKS) is a managed Kubernetes service provided by Microsoft Azure. With the increasing use of Kubernetes and AKS, there is a growing need to improve the security and management of access to cloud resources. AKS pod managed identity is a […]

Read more →

Advanced RAG Patterns: From Naive Retrieval to Production-Grade Systems (Part 1 of 2)

Posted on April 7, 2025

Introduction: Retrieval-Augmented Generation (RAG) has become the go-to architecture for building LLM applications that need access to private or current information. By retrieving relevant documents and including them in the prompt, RAG grounds LLM responses in factual content, reducing hallucinations and enabling knowledge that wasn’t in the training data. But naive RAG implementations often disappoint—the […]

Read more →

Searching in

Search Results for: name

AKS Workload Identity

LlamaIndex: The Data Framework for Building Production RAG Applications

Function Calling Deep Dive: Building LLM-Powered Tools and Agents

Quantization Methods for LLMs: GPTQ, AWQ, and BitsAndBytes

AKS pod managed identity

Advanced RAG Patterns: From Naive Retrieval to Production-Grade Systems (Part 1 of 2)