Ask The Expert: “How can we optimize Microsoft Teams performance in our on-prem Parallels RAS deployment with mixed GPU and CPU-only session hosts?”

Août 12, 2025 by Moin Khan

Q: We’re experiencing performance issues with Microsoft Teams in our Parallels Remote Application Server (RAS) environment. Users are complaining about poor audio/video quality and sluggish performance. What’s the best approach to optimize Teams for our on-premises RAS deployment? Scenario: We’re deploying Microsoft Teams across our Parallels RAS infrastructure. Half our session hosts have NVIDIA RTX A4000 cards, while the other half are CPU-only. How should we approach Teams optimization differently for each scenario, and what architecture considerations should we account for?

Architecture Foundation

When deploying Teams in a Parallels RAS environment, the key is understanding that you’re dealing with a multi-layered virtualization stack. Teams needs to efficiently traverse from the endpoint device through the RAS infrastructure to the hosted session, then back again—all while maintaining real-time performance for media workloads.

The presence of GPU hardware fundamentally changes how Teams processes media workloads in your RAS environment. Here’s what you need to architect for:

GPU-Enabled Hosts:

Hardware-accelerated video encoding/decoding
Offloaded media processing from CPU
Higher concurrent user density potential
Different resource allocation patterns

CPU-Only Hosts:

Software-based media processing
CPU-intensive Teams workloads
More conservative concurrent user limits
Traditional resource scaling patterns

Scenario 1: GPU-Enabled RAS Hosts (NVIDIA RTX A4000)

Architecture Considerations

With GPU acceleration, your Teams deployment can handle significantly higher concurrent video sessions. Plan for:

Concurrent video users: 25-30 per host (vs 15-20 CPU-only)
GPU memory allocation: Reserve 2-4GB for Teams workloads
vGPU profiles: Configure appropriate GRID profiles if using virtualized GPUs

Group Policy Configuration

GPU-Specific Teams Settings:

HKLM\SOFTWARE\Microsoft\Teams\IsVirtualDesktopOptimized = 1 (DWORD)

HKLM\SOFTWARE\Microsoft\Teams\DisableHWAcceleration = 0 (DWORD)

HKLM\SOFTWARE\Microsoft\Teams\DisableGpuAcceleration = 0 (DWORD)

RAS Multimedia Policies:

Enable hardware-accelerated media redirection
Configure GPU passthrough for Teams processes
Set multimedia bandwidth limits per session: 5-8 Mbps

Resource Allocation

GPU Memory Management:

Allocate 512MB-1GB GPU memory per concurrent video session
Monitor GPU utilization targeting <80% peak usage
Configure GPU memory oversubscription if using vGPU

System Resources:

CPU: 2-3% per concurrent Teams session (reduced from CPU-only)
RAM: 400-500MB per active Teams user
Storage IOPS: 25-30 per user (GPU cache requirements)

Scenario 2: CPU-Only RAS Hosts

Architecture Considerations

Without GPU acceleration, focus on CPU optimization and more conservative scaling:

Concurrent video users: 12-15 per host maximum
CPU core allocation: Minimum 2 cores per 8 concurrent users
Memory over-allocation: Essential for software video processing

Group Policy Configuration

CPU-Optimized Teams Settings:

HKLM\SOFTWARE\Microsoft\Teams\IsVirtualDesktopOptimized = 1 (DWORD)

HKLM\SOFTWARE\Microsoft\Teams\DisableHWAcceleration = 1 (DWORD)

HKLM\SOFTWARE\Microsoft\Teams\DisableGpuAcceleration = 1 (DWORD)

Performance Optimization Policies:

Set process priority for Teams.exe to “Above Normal”
Configure CPU affinity to dedicate cores for media processing
Implement aggressive Windows Search exclusions

Resource Allocation

CPU Management:

CPU: 8-12% per concurrent Teams video session
Reserve 4 CPU cores minimum for Teams workloads
Monitor CPU utilization targeting <70% peak usage

Memory Allocation:

RAM: 600-800MB per active Teams user
Configure page file on SSD for overflow
Implement memory compression if available

QoS Strategy: Universal Best Practices

Regardless of GPU presence, implement consistent QoS policies:

Traffic Classification:

Audio (DSCP 46):

├── Guaranteed: 100 kbps per session

├── Latency: <150ms

└── Loss tolerance: <0.1%

Video (DSCP 34):

├── GPU hosts: Up to 2.5 Mbps per session

├── CPU hosts: Up to 1.8 Mbps per session

├── Latency: <400ms

└── Burst allowance: 2x guaranteed rate

OS Optimization by Scenario

GPU Host Optimizations

NVIDIA-Specific:

Install latest GRID/RTX drivers with Teams certification
Configure TCC mode for datacenter cards
Implement GPU monitoring via nvidia-smi

Windows Configuration:

Enable Hardware-accelerated GPU scheduling
Configure Graphics performance preference for Teams
Set power management to maximum performance

CPU Host Optimizations

Processor Configuration:

Disable CPU power management/C-states
Enable all processor cores for scheduling
Configure processor affinity for Teams processes

System Tuning:

Increase system working set size
Configure aggressive prefetch settings
Implement CPU priority boosting for media workloads

Performance Benchmarks

GPU-Enabled Targets

Concurrent 720p video sessions: 25-30 per host
GPU utilization: <80% peak
CPU utilization: <60% peak
Video quality score: >4.2/5

CPU-Only Targets

Concurrent 720p video sessions: 12-15 per host
CPU utilization: <70% peak
Memory utilization: <85% peak
Video quality score: >3.8/5

Load Balancing Strategy

Intelligent Workload Distribution:

Priority routing: Direct video-heavy users to GPU hosts
Fallback logic: CPU hosts handle overflow and audio-only sessions
Resource monitoring: Real-time redirection based on utilization
User experience: Maintain session affinity once established

Monitoring KPIs by Configuration

GPU Host Monitoring

GPU memory utilization and temperature
CUDA core usage patterns
Video encoding/decoding throughput
Teams’ hardware acceleration status

CPU Host Monitoring

Per-core CPU utilization during Teams calls
Memory allocation and page file usage
Context switching rates during peak usage
Teams’ software rendering performance

Le bilan

Your dual-configuration approach requires thoughtful architecture but delivers significant benefits. GPU hosts can support 60-70% more concurrent video users while providing better quality. CPU hosts, when properly optimized, still deliver solid performance for mixed workloads.

Key Success Factors:

Differentiated policies: Configure Teams settings specific to each host type
Intelligent routing: Direct appropriate workloads to optimal hardware
Resource monitoring: Track GPU and CPU metrics separately
User experience: Maintain consistent performance regardless of backend assignment

The investment in GPU acceleration typically pays for itself through improved user density and experience—just ensure your architecture takes full advantage of both configurations.

Got complex architecture scenarios? Let’s solve them together.

Partager cet article

French

À propos de nous

Partenaires

Prix

XenTegra se soucie

Qui sommes-nous ?

Espaces de travail virtuels

Gestion des services informatiques

Applications

Solutions de réseau

Point final

Suivi et analyse

Sécurité gérée

Reprise après sinistre en tant que service (DRaaS)

Desktop as a Service (DaaS)

Nuage public

Nuage privé hébergé Nutanix

Infrastructure en tant que service

Reprise après sinistre en tant que service (DRaaS)

Desktop as a Service (DaaS)

Services de colocation

Nuage public

Cloud hyperscale

Surveillance de l'infrastructure

Sauvegarde et récupération

Infrastructure hyperconvergée

Nutanix

Matériel de serveur

Cohésion

Nuage Azure

Amazon Web Services (AWS)

Services professionnels

Solutions Windows et Microsoft 365

Infrastructure de bureau virtuel (VDI)

Contrôle

Points finaux

Omnissa

Citrix

Microsoft

Bureau virtuel Azure (AVD)

Services de conseil en IA

ServiceNow

XenTegra récompensé en tant que partenaire revendeur ServiceNow de l'année 2024 et finaliste en tant que partenaire de conseil et de mise en œuvre

Salesforce

L'équipe violette persévérante

Détection et réponse gérées (MDR)

Test de pénétration

Gestion de la cyber-exposition (CEM)

Sécurité de l'informatique en nuage

Protection du courrier électronique

Infrastructure de réseau

SD-WAN

SASE (Secure Access Service Edge)

Pare-feu

Micro-segmentation

Tous les services d'appui

Hybrid Support

Support Citrix

ServiceNow Support

Support IGEL

Support Fortinet

Soutien au Tanium

Support Nutanix

Support PrinterLogic

Support LogicMonitor

Support Microsoft

Secteur public

Petites et moyennes entreprises

Personnel

Contrats d'engagement programmé

La gestion de projet en tant que service

Ce que nous faisons

Évaluations gratuites

Actualités

Blogs

Podcasts

Réductions sur les cartes d'accès à la conférence

Formation à prix réduit

Carrières

Ressources

Ask The Expert: “How can we optimize Microsoft Teams performance in our on-prem Parallels RAS deployment with mixed GPU and CPU-only session hosts?”