Manage and increase quotas for resources with Azure AI Studio - Azure AI Studio (2024)

  • Article

Important

Some of the features described in this article might only be available in preview. This preview is provided without a service-level agreement, and we don't recommend it for production workloads. Certain features might not be supported or might have constrained capabilities. For more information, see Supplemental Terms of Use for Microsoft Azure Previews.

Quota provides the flexibility to actively manage the allocation of rate limits across the deployments within your subscription. This article walks through the process of managing quota for your Azure AI Studio virtual machines and Azure OpenAI models.

Azure uses limits and quotas to prevent budget overruns due to fraud, and to honor Azure capacity constraints. It's also a good way to control costs for admins. Consider these limits as you scale for production workloads.

In this article, you learn about:

  • Default limits on Azure resources
  • Creating Azure AI Studio hub-level quotas.
  • Viewing your quotas and limits
  • Requesting quota and limit increases

Special considerations

Quotas are applied to each subscription in your account. If you have multiple subscriptions, you must request a quota increase for each subscription.

A quota is a credit limit on Azure resources, not a capacity guarantee. If you have large-scale capacity needs, contact Azure support to increase your quota.

Note

Azure AI Studio compute has a separate quota from the core compute quota.

Default limits vary by offer category type, such as free trial, pay-as-you-go, and virtual machine (VM) series (such as Dv2, F, and G).

Azure AI Studio quota

The following actions in Azure AI Studio consume quota:

  • Creating a compute instance.
  • Building a vector index.
  • Deploying open models from model catalog.

Azure AI Studio compute

Azure AI Studio compute has a default quota limit on both the number of cores and the number of unique compute resources that are allowed per region in a subscription.

  • The quota on the number of cores is split by each VM Family and cumulative total cores.
  • The quota on the number of unique compute resources per region is separate from the VM core quota, as it applies only to the managed compute resources

To raise the limits for compute, you can request a quota increase in the Azure AI Studio.

Available resources include:

  • Dedicated cores per region have a default limit of 24 to 300, depending on your subscription offer type. You can increase the number of dedicated cores per subscription for each VM family. Specialized VM families like NCv2, NCv3, or ND series start with a default of zero cores. GPUs also default to zero cores.
  • Total compute limit per region has a default limit of 500 per region within a given subscription and can be increased up to a maximum value of 2500 per region. This limit is shared between compute instances, and managed online endpoint deployments. A compute instance is considered a single-node cluster for quota purposes. In order to increase the total compute limit, open an online customer support request.

When opening the support request to increase the total compute limit, provide the following information:

  1. Select Technical for the issue type.

  2. Select the subscription that you want to increase the quota for.

  3. Select Machine Learning as the service type.

  4. Select the resource that you want to increase the quota for.

  5. In the Summary field, enter "Increase total compute limits"

  6. Select Compute instance the problem type and Quota as the problem subtype.

  7. Select Next.

  8. On the Additional details page, provide the subscription ID, region, new limit (between 500 and 2500), and business justification to increase the total compute limits for the region.

  9. Select Create to submit the support request ticket.

Azure AI Studio provides a pool of shared quota that is available for different users across various regions to use concurrently. Depending upon availability, users can temporarily access quota from the shared pool, and use the quota to perform testing for a limited amount of time. The specific time duration depends on the use case. By temporarily using quota from the quota pool, you no longer need to file a support ticket for a short-term quota increase or wait for your quota request to be approved before you can proceed with your workload.

Use of the shared quota pool is available for testing inferencing for Llama-2, Phi, Nemotron, Mistral, Dolly, and Deci-DeciLM models from the Model Catalog. You should use the shared quota only for creating temporary test endpoints, not production endpoints. For endpoints in production, you should request dedicated quota. Billing for shared quota is usage-based, just like billing for dedicated virtual machine families.

Container Instances

For more information, see Container Instances limits.

Storage

Azure Storage has a limit of 250 storage accounts per region, per subscription. This limit includes both Standard and Premium storage accounts.

View and request quotas in Azure AI Studio

Use quotas to manage compute target allocation between multiple Azure AI Studio hubs in the same subscription.

By default, all hubs share the same quota as the subscription-level quota for VM families. However, you can set a maximum quota for individual VM families for more granular cost control and governance on hubs in a subscription. Quotas for individual VM families let you share capacity and avoid resource contention issues.

  1. In Azure AI Studio, go to the Home page and select either Model quota or VM quota from the Management section.

  2. When you select Model quota, you can view the quota for the models in the selected Azure region. To request more quota, select the model and then select Request quota.

    • Use the Show all quota toggle to display all quota or only the currently allocated quota.
    • Use the Group by dropdown to group the list by Quota type, Region & Model, Quota type, Model & Region, or None. The None grouping displays a list of model deployments.
    • Expand the groupings to view information on specific model deployments. While viewing a model deployment, select the pencil icon in the Quota allocation column to edit the quota allocation for the model deployment.
    • Use the charts along the side of the page to view more details about quota usage. The charts are interactive; hovering over a section of the chart displays more information, and selecting the chart filters the list of models. Selecting the chart legend filters the data displayed in the chart.
    • Use the Azure OpenAI Provisioned link to view information about provisioned models, including a Capacity calculator.

  3. When you select VM quota, you can view the quota and usage for the virtual machine families in the selected Azure region. To request more quota, select the VM family and then select Request quota.

Next steps

  • Plan to manage costs
  • How to create compute
Manage and increase quotas for resources with Azure AI Studio - Azure AI Studio (2024)
Top Articles
how to schedule VPN gateway? - Microsoft Q&A
Static classes and static class members in C# explained
Craigslist Monterrey Ca
Dte Outage Map Woodhaven
Arkansas Gazette Sudoku
Kraziithegreat
What Happened To Dr Ray On Dr Pol
Georgia Vehicle Registration Fees Calculator
Klustron 9
Holly Ranch Aussie Farm
CA Kapil 🇦🇪 Talreja Dubai on LinkedIn: #businessethics #audit #pwc #evergrande #talrejaandtalreja #businesssetup…
New Day Usa Blonde Spokeswoman 2022
83600 Block Of 11Th Street East Palmdale Ca
Tugboat Information
Mycarolinas Login
Cincinnati Bearcats roll to 66-13 win over Eastern Kentucky in season-opener
Johnston v. State, 2023 MT 20
8 Ways to Make a Friend Feel Special on Valentine's Day
Guilford County | NCpedia
Used Drum Kits Ebay
House Of Budz Michigan
065106619
Bend Pets Craigslist
Curry Ford Accident Today
Lola Bunny R34 Gif
Culver's Flavor Of The Day Taylor Dr
Bekijk ons gevarieerde aanbod occasions in Oss.
Www.publicsurplus.com Motor Pool
Yisd Home Access Center
Plaza Bonita Sycuan Bus Schedule
R&S Auto Lockridge Iowa
27 Fantastic Things to do in Lynchburg, Virginia - Happy To Be Virginia
Tomb Of The Mask Unblocked Games World
Rainfall Map Oklahoma
Storelink Afs
Angela Muto Ronnie's Mom
Gerber Federal Credit
Tributes flow for Soundgarden singer Chris Cornell as cause of death revealed
Exploring The Whimsical World Of JellybeansBrains Only
Ducky Mcshweeney's Reviews
AsROck Q1900B ITX und Ramverträglichkeit
New Gold Lee
Bimmerpost version for Porsche forum?
The Realreal Temporary Closure
Craigslist Farm And Garden Reading Pa
Lady Nagant Funko Pop
Love Words Starting with P (With Definition)
Az Unblocked Games: Complete with ease | airSlate SignNow
Po Box 101584 Nashville Tn
Mynord
Rocket Lab hiring Integration & Test Engineer I/II in Long Beach, CA | LinkedIn
Scott Surratt Salary
Latest Posts
Article information

Author: Aracelis Kilback

Last Updated:

Views: 6192

Rating: 4.3 / 5 (64 voted)

Reviews: 95% of readers found this page helpful

Author information

Name: Aracelis Kilback

Birthday: 1994-11-22

Address: Apt. 895 30151 Green Plain, Lake Mariela, RI 98141

Phone: +5992291857476

Job: Legal Officer

Hobby: LARPing, role-playing games, Slacklining, Reading, Inline skating, Brazilian jiu-jitsu, Dance

Introduction: My name is Aracelis Kilback, I am a nice, gentle, agreeable, joyous, attractive, combative, gifted person who loves writing and wants to share my knowledge and understanding with you.