luke.geek.nz | luke.geek.nz

Deploying Large Language Models on AKS with Kaito

May 8, 2024 · 16 min read

Author

Today, we are going to look at deploying a large language model (LLM) directly into your AKS (Azure Kubernetes Service) cluster, running on GPU-enabled nodes, using Kaito (Kubernetes AI Toolchain Operator).

KAITO is an open-source operator that transforms how you deploy AI models on Kubernetes. It streamlines the process, automating critical tasks like infrastructure provisioning and resource optimization. It intelligently selects the optimal hardware configuration for your specific model, using available CPU and GPU resources on AKS. KAITO eliminates the manual setup complexities, accelerating your deployment time and reducing associated costs.

Authorization Permission Mismatch error with Azure Storage

May 6, 2024 · One min read

Luke Murray

Author

When attempting to copy files in an Azure DevOps pipeline to an Azure Storage account, I received the following error:

warning

##[error]Upload to container: 'StorageContainer' in storage account: 'ContosoStorageAccount' with blob prefix: '736' failed with error: 'AzCopy.exe exited with non-zero exit code while uploading files to blob storage.' For more info please refer to https://aka.ms/azurefilecopyreadme.

Federated Credentials to AKS Managed Identity

May 5, 2024 · 6 min read

Luke Murray

Author

Workloads deployed on an Azure Kubernetes Services (AKS) cluster require Microsoft Entra application credentials or managed identities to access Microsoft Entra-protected resources, such as Azure Key Vault and Microsoft Graph. Microsoft Entra Workload ID integrates with the capabilities native to Kubernetes to federate with external identity providers.

Let's look at how that might be set up for Managed Identity for AKS (Azure Kubernetes Service) in Azure.

AKS Workload Identity

Uncollapsing the Azure Portal Blade Menu

May 5, 2024 · One min read

Luke Murray

Author

There have been some changes to the Azure Portal that have people scratching their heads; I am talking about the infamous collapsable Blade menu.

Azure Portal - Collapse Blade Menu

Troubleshooting scenarios for Azure Open AI

April 26, 2024 · 8 min read

Luke Murray

Author

Troubleshooting Azure Open AI and API calls to OpenAI can be challenging, especially when you may not know where to start!

The idea of this article is to give you not only a place to start with some common scenarios you may run into but also a way of thinking - to help you troubleshoot. This is not a technical 'get-your-hands dirty, delve into those logs' type article.

I am a big fan of the KT, or Kepner-Tregoe problem analysis methodology, and I have used it in many scenarios throughout my career to help discover and test the root cause of various problems. So, we will use the base of this problem analysis methodology to help us troubleshoot the scenarios we will discuss in this article.

Kepner-Tregoe Method