Education

Room Acoustics Basics — AV System Design Considerations

Room acoustics is the single most important factor in AV system audio performance that is not directly controlled by the AV system. A microphone array with excellent AEC cannot compensate for a reverberant conference room. A high-quality loudspeaker system sounds unintelligible in a gymnasium with 4-second reverberation. AV system designers must understand acoustics well enough to specify acoustic requirements, recognize problem spaces, and recommend treatment before (not after) system installation.

Reverberation Time — RT60

RT60 is the time it takes for a sound to decay by 60 dB after the source stops. RT60 is the most fundamental acoustic measurement for AV system design. Lower RT60 means a "drier" room; higher RT60 means a "livelier" or more reverberant room.

RT60 targets by application:

Space TypeTarget RT60Notes
Small conference room (< 300 sq ft)0.3–0.5 sAEC, beamforming mics require ≤ 0.5 s
Medium conference room (300–1000 sq ft)0.4–0.6 sAEC performance degrades above 0.6 s
Boardroom / executive suite0.4–0.6 sSpeech clarity critical; treat reflective surfaces
Training room / classroom0.5–0.7 sBalance speech intelligibility and liveness
Lecture hall / auditorium0.8–1.2 sReinforced speech; AV system compensates with delay
House of worship1.2–2.5 sMusic requires longer decay; speech intelligibility suffers
Recording studio (control room)0.2–0.3 sNear-anechoic; critical listening
Gymnasium / open warehouse2.5–5 s+AV sound reinforcement extremely difficult

RT60 is measured using a calibrated SPL meter with fractional-octave analysis. Measurements are typically taken at 500 Hz and 1 kHz (mid-frequency RT60) as the reference values for speech applications. Low-frequency RT60 (125 Hz, 250 Hz) is usually longer and harder to treat.

Early Reflections and Late Reverberation

Sound from a loudspeaker (or talker) reaches a listener both directly and via reflections off room surfaces. These two components behave differently:

Direct sound — Travels straight from source to listener. First to arrive; carries the primary intelligibility cues.

Early reflections — Arrive within 50 ms of the direct sound. At small delay offsets (< 30 ms), early reflections actually improve intelligibility by reinforcing the direct sound (the Haas effect). Reflections arriving between 30–50 ms with high level begin to cause comb filtering (coloration) and slight clarity degradation.

Late reverberation — Arrives more than 50 ms after direct sound. This is the "tail" that fills the room after a sound ends. High late reverberation is the primary cause of poor speech intelligibility in reverberant spaces. The critical distinction: early reflections are your friend; late reverberation is your enemy in speech applications.

Room Modes

Room modes (resonances) are standing waves that develop between parallel surfaces at specific frequencies. A room mode causes some frequencies to be dramatically louder or quieter depending on where in the room you stand.

The fundamental frequency of a room mode is: f = 1125 / (2 × dimension in feet)

A 20-foot room has a fundamental axial mode at 28 Hz. The 2nd harmonic is at 56 Hz, 3rd at 84 Hz, etc. Multiple modes accumulate at low frequencies, causing the "boomy" quality characteristic of untreated rooms.

Axial modes — Between two parallel surfaces (floor/ceiling, two walls). Most energetic. Tangential modes — Involve four surfaces. Weaker than axial by ~3 dB. Oblique modes — Involve all six surfaces. Weakest.

Room modes primarily affect frequencies below 300 Hz. Above this range, modal density becomes high enough that room behavior becomes more diffuse. For AV system design, low-frequency EQ to address room modes must be made with measurement — flat EQ curves do not correct in-room response.

Absorption — Materials and Coefficients

Acoustic absorption converts sound energy to heat via frictional loss in porous materials. Each material has an absorption coefficient (α) from 0 (perfectly reflective) to 1.0 (perfectly absorptive) at each frequency.

Common acoustic materials:

Material125 Hz500 Hz2 kHzNotes
Bare concrete0.010.020.04Essentially reflective at all frequencies
Carpet on concrete0.020.140.37Good high-frequency absorption; minimal LF
Acoustic ceiling tile (standard)0.100.700.80Poor LF; good MF/HF
2" fiberglass board0.170.910.99Good across MF/HF; modest LF
4" fiberglass board0.430.990.99Better LF than 2"; most used wall treatment
Bass trap (thick mineral wool)0.650.990.99Designed for LF absorption
Upholstered seating (occupied)0.440.600.62Significant absorption; unoccupied rooms sound different
Glass / drywall0.050.040.03Reflective; slight LF absorption due to vibration

Key insight from the table: Most common materials (carpet, ceiling tile) are effective absorbers at mid-to-high frequencies but largely transparent to low frequencies. Controlling bass reverberation requires purpose-built bass traps with thick material (4"+) or air gaps.

Acoustic Treatment Products

Fabric-wrapped panels — Rigid fiberglass or mineral wool (2-4") wrapped in acoustically transparent fabric. Placed on walls to absorb mid and high frequencies. Most common treatment in conference rooms and boardrooms.

Bass traps — Thick absorptive material (4-8" rockwool, specialized membrane absorbers) placed in corners where bass energy accumulates. Corners have 2–3x the pressure of open wall surfaces, making them the most effective placement for low-frequency absorption.

Diffusers — Irregular surface profiles (QRD — Quadratic Residue Diffusers, skyline diffusers) that scatter sound in multiple directions without absorbing it. Diffusion maintains a sense of spaciousness while reducing flutter echo. Used in music rehearsal rooms, recording studios, and spaces where excessive absorption would sound "dead."

Ceiling clouds — Horizontal absorptive panels suspended from the ceiling above the listening/conferencing area. Highly effective at reducing the first ceiling reflection, which is the dominant early reflection in most conference rooms.

Underlay and floor treatment — Carpet is the most cost-effective acoustic treatment in a conference room. Hard-floor conference rooms have significantly higher RT60 and first-reflection energy.

HVAC Noise — NC Ratings

Background noise from HVAC systems is the second major acoustic factor affecting AV system performance. The NC (Noise Criteria) rating describes the level and spectrum of HVAC noise.

NC ratings for AV applications:

NC RatingSound Level (approx.)Appropriate Spaces
NC-20~28 dBARecording studios, critical listening
NC-25~33 dBAExecutive boardrooms, high-quality conferencing
NC-30~38 dBARecommended for conferencing; maximum for AEC performance
NC-35~42 dBAAcceptable for presentation; conferencing performance degrades
NC-40~47 dBATraining rooms, open offices; microphone noise floor becomes audible
NC-45~52 dBAMany existing office buildings; significant AEC and gain limitation
NC-50+~57 dBA+Industrial spaces; sound reinforcement extremely difficult

AEC algorithms require the acoustic echo signal to be significantly above the noise floor. At NC-40+, microphone signal-to-noise ratio is reduced enough that AEC performance degrades noticeably. The target for rooms with AEC-based conferencing is NC-30 or lower.

Common HVAC noise sources: supply air grilles with excessive velocity (fix: low-velocity diffusers, larger duct openings), return air grilles in adjacent walls (fix: relocate, use sound boots), variable air volume boxes (fix: duct lining, silencers).

Speech Intelligibility — STI

STI (Speech Transmission Index) is a standardized measurement of how intelligible speech is in a room, accounting for both reverberation and noise. STI ranges from 0 (unintelligible) to 1.0 (perfect).

STI ValueRatingConferencing Suitability
0.75–1.0ExcellentIdeal
0.60–0.75GoodAcceptable for most conferencing
0.45–0.60FairMarginal; acoustic treatment recommended
0.30–0.45PoorConferencing difficult; treatment required
< 0.30BadIntelligibility insufficient; major intervention needed

STI is measured with a calibrated measurement system (Rational Acoustics Smaart, IVIE Technologies, Norsonic) using a stimulus signal. STI values below 0.60 in a space being evaluated for conferencing should trigger a recommendation for acoustic treatment before AV system installation.

Common Pitfalls

  • Installing AV before acoustic treatment — AEC, beamforming mics, and speaker tuning cannot compensate for RT60 above 0.8 seconds. If the room requires acoustic treatment, treatment must precede or accompany the AV installation.
  • Underestimating HVAC noise — Many conference rooms have NC-40 or higher HVAC noise due to undersized ductwork or supply grilles. Measure NC before specifying microphone sensitivity and AEC configuration.
  • Treating only high frequencies — Standard wall panels and ceiling tiles address MF/HF reverberation well. LF reverberation (low rumble quality) requires bass traps. A room with abundant acoustic panels but no LF treatment often has a "hollow" quality.
  • Assuming occupied room equals measured room — An unoccupied conference room with hard chairs has significantly different RT60 than when the same room has 10 people in upholstered seats. Measure under both conditions when possible, and design for occupied use.
  • Ignoring flutter echo — Parallel hard surfaces (two glass walls, hard ceiling and hard floor) create flutter echo — a rapid, metallic ping heard after hand claps or sharp sounds. Flutter echo is distinct from general reverberation and requires diffusion or absorption on one of the two parallel surfaces.

Related

Continue reading in the knowledge base.

AEC — Acoustic Echo Cancellation

DSP algorithm that removes room reflections and far-end voice echo from microphone signals to enable full-duplex conferencing.

Open note →

DSP Fundamentals — Digital Signal Processing for AV

Technical foundations of digital signal processing in AV systems — ADC/DAC, filters, dynamics, delay, AEC, and matrix mixing as implemented in professional DSP hardware.

Open note →

DSP — Digital Signal Processing

Real-time digital manipulation of audio signals using algorithms running on dedicated processor hardware, enabling mixing, routing, EQ, dynamics, and conferencing functions in AV systems.

Open note →

Microphone Types — Transducers, Polar Patterns, and AV Applications

Professional microphone types, transducer technologies, polar patterns, key specifications, and application guidance for AV system designers.

Open note →

Speaker Placement — Coverage, Delay, and System Design

Practical guide to loudspeaker placement in AV systems — coverage geometry, delay alignment, distributed vs. point source systems, and common installation pitfalls.

Open note →

Signal Flow — AV System Signal Chain

Comprehensive guide to signal flow in AV systems — from microphone through DSP to amplifier, video distribution, conferencing bidirectional paths, and gain structure.

Open note →

UCC — Unified Communications and Collaboration

Integrated platforms combining video conferencing, voice, messaging, and content sharing for seamless hybrid communication — Teams, Zoom, Webex, and Google Meet.

Open note →

We use optional analytics cookies to understand site usage and improve the experience. You can accept or reject.