Room Acoustics Basics — AV System Design Considerations
Room acoustics is the single most important factor in AV system audio performance that is not directly controlled by the AV system. A microphone array with excellent AEC cannot compensate for a reverberant conference room. A high-quality loudspeaker system sounds unintelligible in a gymnasium with 4-second reverberation. AV system designers must understand acoustics well enough to specify acoustic requirements, recognize problem spaces, and recommend treatment before (not after) system installation.
Reverberation Time — RT60
RT60 is the time it takes for a sound to decay by 60 dB after the source stops. RT60 is the most fundamental acoustic measurement for AV system design. Lower RT60 means a "drier" room; higher RT60 means a "livelier" or more reverberant room.
RT60 targets by application:
| Space Type | Target RT60 | Notes |
|---|---|---|
| Small conference room (< 300 sq ft) | 0.3–0.5 s | AEC, beamforming mics require ≤ 0.5 s |
| Medium conference room (300–1000 sq ft) | 0.4–0.6 s | AEC performance degrades above 0.6 s |
| Boardroom / executive suite | 0.4–0.6 s | Speech clarity critical; treat reflective surfaces |
| Training room / classroom | 0.5–0.7 s | Balance speech intelligibility and liveness |
| Lecture hall / auditorium | 0.8–1.2 s | Reinforced speech; AV system compensates with delay |
| House of worship | 1.2–2.5 s | Music requires longer decay; speech intelligibility suffers |
| Recording studio (control room) | 0.2–0.3 s | Near-anechoic; critical listening |
| Gymnasium / open warehouse | 2.5–5 s+ | AV sound reinforcement extremely difficult |
RT60 is measured using a calibrated SPL meter with fractional-octave analysis. Measurements are typically taken at 500 Hz and 1 kHz (mid-frequency RT60) as the reference values for speech applications. Low-frequency RT60 (125 Hz, 250 Hz) is usually longer and harder to treat.
Early Reflections and Late Reverberation
Sound from a loudspeaker (or talker) reaches a listener both directly and via reflections off room surfaces. These two components behave differently:
Direct sound — Travels straight from source to listener. First to arrive; carries the primary intelligibility cues.
Early reflections — Arrive within 50 ms of the direct sound. At small delay offsets (< 30 ms), early reflections actually improve intelligibility by reinforcing the direct sound (the Haas effect). Reflections arriving between 30–50 ms with high level begin to cause comb filtering (coloration) and slight clarity degradation.
Late reverberation — Arrives more than 50 ms after direct sound. This is the "tail" that fills the room after a sound ends. High late reverberation is the primary cause of poor speech intelligibility in reverberant spaces. The critical distinction: early reflections are your friend; late reverberation is your enemy in speech applications.
Room Modes
Room modes (resonances) are standing waves that develop between parallel surfaces at specific frequencies. A room mode causes some frequencies to be dramatically louder or quieter depending on where in the room you stand.
The fundamental frequency of a room mode is: f = 1125 / (2 × dimension in feet)
A 20-foot room has a fundamental axial mode at 28 Hz. The 2nd harmonic is at 56 Hz, 3rd at 84 Hz, etc. Multiple modes accumulate at low frequencies, causing the "boomy" quality characteristic of untreated rooms.
Axial modes — Between two parallel surfaces (floor/ceiling, two walls). Most energetic. Tangential modes — Involve four surfaces. Weaker than axial by ~3 dB. Oblique modes — Involve all six surfaces. Weakest.
Room modes primarily affect frequencies below 300 Hz. Above this range, modal density becomes high enough that room behavior becomes more diffuse. For AV system design, low-frequency EQ to address room modes must be made with measurement — flat EQ curves do not correct in-room response.
Absorption — Materials and Coefficients
Acoustic absorption converts sound energy to heat via frictional loss in porous materials. Each material has an absorption coefficient (α) from 0 (perfectly reflective) to 1.0 (perfectly absorptive) at each frequency.
Common acoustic materials:
| Material | 125 Hz | 500 Hz | 2 kHz | Notes |
|---|---|---|---|---|
| Bare concrete | 0.01 | 0.02 | 0.04 | Essentially reflective at all frequencies |
| Carpet on concrete | 0.02 | 0.14 | 0.37 | Good high-frequency absorption; minimal LF |
| Acoustic ceiling tile (standard) | 0.10 | 0.70 | 0.80 | Poor LF; good MF/HF |
| 2" fiberglass board | 0.17 | 0.91 | 0.99 | Good across MF/HF; modest LF |
| 4" fiberglass board | 0.43 | 0.99 | 0.99 | Better LF than 2"; most used wall treatment |
| Bass trap (thick mineral wool) | 0.65 | 0.99 | 0.99 | Designed for LF absorption |
| Upholstered seating (occupied) | 0.44 | 0.60 | 0.62 | Significant absorption; unoccupied rooms sound different |
| Glass / drywall | 0.05 | 0.04 | 0.03 | Reflective; slight LF absorption due to vibration |
Key insight from the table: Most common materials (carpet, ceiling tile) are effective absorbers at mid-to-high frequencies but largely transparent to low frequencies. Controlling bass reverberation requires purpose-built bass traps with thick material (4"+) or air gaps.
Acoustic Treatment Products
Fabric-wrapped panels — Rigid fiberglass or mineral wool (2-4") wrapped in acoustically transparent fabric. Placed on walls to absorb mid and high frequencies. Most common treatment in conference rooms and boardrooms.
Bass traps — Thick absorptive material (4-8" rockwool, specialized membrane absorbers) placed in corners where bass energy accumulates. Corners have 2–3x the pressure of open wall surfaces, making them the most effective placement for low-frequency absorption.
Diffusers — Irregular surface profiles (QRD — Quadratic Residue Diffusers, skyline diffusers) that scatter sound in multiple directions without absorbing it. Diffusion maintains a sense of spaciousness while reducing flutter echo. Used in music rehearsal rooms, recording studios, and spaces where excessive absorption would sound "dead."
Ceiling clouds — Horizontal absorptive panels suspended from the ceiling above the listening/conferencing area. Highly effective at reducing the first ceiling reflection, which is the dominant early reflection in most conference rooms.
Underlay and floor treatment — Carpet is the most cost-effective acoustic treatment in a conference room. Hard-floor conference rooms have significantly higher RT60 and first-reflection energy.
HVAC Noise — NC Ratings
Background noise from HVAC systems is the second major acoustic factor affecting AV system performance. The NC (Noise Criteria) rating describes the level and spectrum of HVAC noise.
NC ratings for AV applications:
| NC Rating | Sound Level (approx.) | Appropriate Spaces |
|---|---|---|
| NC-20 | ~28 dBA | Recording studios, critical listening |
| NC-25 | ~33 dBA | Executive boardrooms, high-quality conferencing |
| NC-30 | ~38 dBA | Recommended for conferencing; maximum for AEC performance |
| NC-35 | ~42 dBA | Acceptable for presentation; conferencing performance degrades |
| NC-40 | ~47 dBA | Training rooms, open offices; microphone noise floor becomes audible |
| NC-45 | ~52 dBA | Many existing office buildings; significant AEC and gain limitation |
| NC-50+ | ~57 dBA+ | Industrial spaces; sound reinforcement extremely difficult |
AEC algorithms require the acoustic echo signal to be significantly above the noise floor. At NC-40+, microphone signal-to-noise ratio is reduced enough that AEC performance degrades noticeably. The target for rooms with AEC-based conferencing is NC-30 or lower.
Common HVAC noise sources: supply air grilles with excessive velocity (fix: low-velocity diffusers, larger duct openings), return air grilles in adjacent walls (fix: relocate, use sound boots), variable air volume boxes (fix: duct lining, silencers).
Speech Intelligibility — STI
STI (Speech Transmission Index) is a standardized measurement of how intelligible speech is in a room, accounting for both reverberation and noise. STI ranges from 0 (unintelligible) to 1.0 (perfect).
| STI Value | Rating | Conferencing Suitability |
|---|---|---|
| 0.75–1.0 | Excellent | Ideal |
| 0.60–0.75 | Good | Acceptable for most conferencing |
| 0.45–0.60 | Fair | Marginal; acoustic treatment recommended |
| 0.30–0.45 | Poor | Conferencing difficult; treatment required |
| < 0.30 | Bad | Intelligibility insufficient; major intervention needed |
STI is measured with a calibrated measurement system (Rational Acoustics Smaart, IVIE Technologies, Norsonic) using a stimulus signal. STI values below 0.60 in a space being evaluated for conferencing should trigger a recommendation for acoustic treatment before AV system installation.
Common Pitfalls
- Installing AV before acoustic treatment — AEC, beamforming mics, and speaker tuning cannot compensate for RT60 above 0.8 seconds. If the room requires acoustic treatment, treatment must precede or accompany the AV installation.
- Underestimating HVAC noise — Many conference rooms have NC-40 or higher HVAC noise due to undersized ductwork or supply grilles. Measure NC before specifying microphone sensitivity and AEC configuration.
- Treating only high frequencies — Standard wall panels and ceiling tiles address MF/HF reverberation well. LF reverberation (low rumble quality) requires bass traps. A room with abundant acoustic panels but no LF treatment often has a "hollow" quality.
- Assuming occupied room equals measured room — An unoccupied conference room with hard chairs has significantly different RT60 than when the same room has 10 people in upholstered seats. Measure under both conditions when possible, and design for occupied use.
- Ignoring flutter echo — Parallel hard surfaces (two glass walls, hard ceiling and hard floor) create flutter echo — a rapid, metallic ping heard after hand claps or sharp sounds. Flutter echo is distinct from general reverberation and requires diffusion or absorption on one of the two parallel surfaces.