System and Method for Continuous Road Surface Friction Coefficient Estimation Using Crowdsourced Vehicle Tire-Road Acoustic Emission, Wheel Speed Sensor Micro-Slip Analysis, and Self-Supervised Contrastive Learning for Real-Time Friction Mapping
Abstract
Disclosed is a system and method for continuously estimating road surface friction coefficients in real time by fusing two complementary signals already available in modern production vehicles: tire-road acoustic emission captured by cabin microphones (installed for active noise cancellation and voice assistants), and wheel speed sensor micro-slip ratios measured during normal straight-line and cornering maneuvers below the tire's saturation region. A self-supervised contrastive learning framework eliminates the need for labeled friction ground truth by exploiting a consistency constraint: multiple vehicles traversing the same road segment within a short temporal window should produce concordant friction estimates regardless of vehicle mass, tire compound, or speed. The system aggregates per-vehicle friction estimates into a spatiotemporal map tiled at 10-meter longitudinal resolution, updated at sub-minute latency, and served to navigation systems and autonomous driving stacks via a standardized API. Field-deployable without any new hardware on vehicles manufactured after 2020, the system converts the existing fleet into a distributed friction sensing network covering every road that vehicles drive on.
Field of the Invention
This invention relates to automotive safety and road condition monitoring, specifically to methods for estimating tire-road friction coefficients using passive acoustic sensing and vehicle dynamics telemetry, combined with machine learning techniques that require no labeled training data, to generate real-time crowdsourced friction maps at scale.
Background
Road surface friction is the single most important environmental variable governing vehicle stopping distance, cornering grip, and crash risk. The Federal Highway Administration estimates that wet pavement contributes to approximately 860,000 crashes annually in the United States, causing 4,700 fatalities and 338,000 injuries. Black ice alone accounts for roughly 1,300 deaths per year according to FHWA road weather data. Yet no production vehicle sold today can directly measure the friction coefficient of the road ahead.
Existing approaches to friction estimation fall into three categories, each with fundamental limitations:
- Slip-based estimation during braking or hard cornering: ABS and ESC systems estimate friction by measuring wheel slip during emergency maneuvers. US8626454B2 (Continental, 2014) discloses using ABS activation events. The problem is latency. You only learn the road is slippery after the driver has already lost traction. Acosta et al. (2020) surveyed these methods and confirmed that slip-based estimators perform well above 40% tire utilization but are unreliable under normal driving conditions.
- Dedicated friction measurement vehicles: The ASTM E274 locked-wheel skid trailer and the Grip Tester produce laboratory-grade friction measurements. Cost: $200,000-$500,000 per vehicle. Coverage: state DOTs typically test each lane-mile once every 2-3 years. A state like California with 51,000 lane-miles of highway might complete a full friction survey every 36 months, missing seasonal and weather-driven variations entirely.
- Camera and LiDAR-based surface classification: Roychowdhury et al. (IEEE T-ITS, 2021) demonstrated CNN-based road surface classification (dry, wet, snowy, icy) from forward-facing cameras. Tesla and Mobileye have deployed similar classifiers. But classification is not quantification. Knowing a road is "wet" does not distinguish a friction coefficient of 0.5 (light rain on good asphalt) from 0.25 (standing water on polished concrete). The mapping from visual class to friction coefficient is many-to-one and unreliable.
Two physical phenomena make passive friction estimation possible without triggering a slip event:
Tire-road acoustic emission. The contact patch between a tire and the road surface generates broadband noise through three mechanisms: air pumping in tread grooves (dominant at 800-2000 Hz), tread block snap-out (600-1200 Hz), and surface texture-induced vibration (200-1000 Hz). The spectral shape of this noise is a function of surface texture, surface wetness, rubber compound, and contact pressure. Crucially, surface texture and wetness are the dominant determinants of friction. Sandberg and Ejsmont (Applied Acoustics, 2018) showed that the 800-1600 Hz band correlates with macrotexture depth (R² = 0.82), and macrotexture depth correlates with friction at highway speeds (R² = 0.71-0.88) according to Rado (Transportation Research Record, 1996). The acoustic path from tire contact patch to cabin microphone passes through the wheel well, suspension, and vehicle body, but the spectral signature of the road surface is preserved because the vehicle's transfer function is approximately time-invariant for a given speed.
Wheel speed sensor micro-slip. Even during straight-line constant-speed driving, individual wheels exhibit small slip ratio fluctuations (on the order of 0.001-0.01) caused by road surface irregularities, tire non-uniformities, and minor steering corrections. Modern ABS wheel speed sensors resolve these fluctuations at 100 Hz with 0.1 km/h precision. The variance and spectral density of micro-slip fluctuations contain information about the tire operating point on the friction curve. On low-friction surfaces, the tire operates closer to its saturation slip ratio even under normal loads, producing characteristically different micro-slip dynamics. Cai et al. (2023) demonstrated cooperative friction estimation from wheel speed data, achieving R² = 0.76 on labeled test data.
The gap in the prior art is a complete system that: (a) fuses acoustic and micro-slip signals for friction estimation during normal driving without requiring excitation events, (b) eliminates the need for labeled friction ground truth through self-supervised learning, (c) aggregates estimates across a vehicle fleet to produce continuous spatiotemporal friction maps, and (d) operates on hardware already present in modern production vehicles.
Detailed Description
1. On-Vehicle Sensor Configuration
The system requires no hardware modifications to vehicles manufactured after approximately 2020. It uses three existing sensor subsystems. First, one or more cabin microphones already present for hands-free calling, voice assistants, and active noise cancellation (ANC). Vehicles with road noise cancellation (e.g., Bose QuietComfort Road Noise Control, Harman Road Active Noise Control) typically have 4-8 microphones sampling at 44.1 kHz or higher, positioned in the headliner, A-pillars, and kick panels. Even base-model vehicles increasingly include at least two microphones for voice recognition. Second, four ABS wheel speed sensors sampling at 100 Hz with tooth-count resolution of 48 or 96 per revolution, yielding angular resolution of 3.75–7.5 degrees. Third, the vehicle's inertial measurement unit (IMU) providing lateral acceleration, longitudinal acceleration, and yaw rate at 100-200 Hz, available via the ESC module.
All three data streams are accessible through the vehicle's CAN bus or Ethernet backbone. The processing can run on the existing infotainment system-on-chip (e.g., Qualcomm Snapdragon SA8295P, NVIDIA Orin) with less than 5% of available compute capacity.
2. Acoustic Feature Extraction
Audio from the cabin microphones is processed in 250 ms frames with 50% overlap. Each frame undergoes the following pipeline: bandpass filtering from 200 Hz to 4000 Hz to isolate the tire-road interaction band while rejecting powertrain noise (below 200 Hz at highway speeds) and wind noise (broadband but dominant above 4 kHz); computation of a 128-bin log-mel spectrogram using a 2048-point FFT with Hann windowing; extraction of the spectral centroid, spectral spread, spectral rolloff (95th percentile), and spectral flux across consecutive frames; and computation of a 20-coefficient MFCC (mel-frequency cepstral coefficient) vector to capture spectral envelope shape.
For vehicles equipped with ANC reference microphones (typically mounted in the wheel wells or on the suspension struts), these closer-to-source signals bypass much of the cabin transfer function and provide higher signal-to-noise ratio for tire-road noise. When available, the system extracts features from both cabin and reference microphones and concatenates them.
Speed normalization is critical. Tire-road noise intensity scales approximately as the 30th power of the logarithm of speed (the Sandberg speed exponent, typically 25-35 for passenger tires). The feature extractor normalizes all spectral features to a reference speed of 80 km/h using the vehicle's current GPS or wheel-speed-derived velocity.
3. Micro-Slip Feature Extraction
The four wheel speed signals are processed to extract micro-slip features that are informative about the tire's operating point on the friction curve. For each 250 ms window (synchronized with the acoustic frames), the system computes: the longitudinal slip ratio for each wheel as κ = (v_wheel − v_vehicle) / max(v_wheel, v_vehicle), where v_vehicle is the mean of the non-driven wheels or the IMU-integrated velocity; the standard deviation of κ over the window; the power spectral density of κ in the 1-50 Hz band; and the cross-correlation between left and right wheel slip ratios (high correlation indicates road surface effects; low correlation indicates vehicle dynamics effects).
Lateral dynamics features are extracted simultaneously: the ratio of measured lateral acceleration to estimated lateral acceleration from yaw rate and speed (the understeer gradient proxy), and the rear-to-front slip angle ratio estimated from wheel speed differences during cornering.
The combined feature vector per 250 ms frame contains approximately 180 features: 128 from acoustics and 52 from vehicle dynamics.
4. Self-Supervised Contrastive Learning Framework
The key technical contribution of this disclosure is a self-supervised learning approach that trains a friction estimation model without any labeled friction ground truth. The self-supervision signal comes from a spatiotemporal consistency constraint.
Consider two vehicles, A and B, that traverse the same 10-meter road segment within a 15-minute window. If neither vehicle has significantly altered the road surface (no fresh oil spill, no new precipitation), the true friction coefficient of that segment is the same for both vehicles. However, the raw sensor signals differ because the vehicles have different masses, tire compounds, suspension geometries, cabin acoustics, and speeds. A model that correctly extracts friction from these heterogeneous signals should produce the same estimate for both vehicles.
This constraint is formalized as a contrastive learning objective. Let f(θ) be a neural network (architecture: 1D temporal convolutional encoder with 6 layers, 64/128/256 channels, kernel size 7, followed by a 2-layer MLP projection head) that maps the 180-dimensional feature vector to a 32-dimensional friction embedding z. The training loss has two components:
- Positive pairs: Feature vectors from different vehicles on the same road segment within the temporal coherence window. The cosine similarity of their embeddings should be high: L_pos = −log(exp(sim(z_A, z_B) / τ) / Σ_neg exp(sim(z_A, z_neg) / τ)), where τ = 0.07 is a temperature parameter.
- Negative pairs: Feature vectors from the same vehicle on road segments known to have different friction (e.g., a dry highway segment versus a bridge deck during rain, identified by large embedding distance in the current model). Hard negative mining selects informative negatives from a memory bank of 65,536 embeddings.
A linear regression head maps the learned embedding to a scalar friction coefficient μ ∈ [0.0, 1.2]. This head is calibrated using sparse ground truth from three sources: municipal friction surveys conducted per ASTM E274 (available for approximately 15% of highway lane-miles annually), weather station data correlated with known friction-temperature-precipitation curves from Hippi and Kangas (TRR, 2015), and ABS/ESC activation events that provide instantaneous friction estimates at the moment of wheel lock-up or stability intervention.
5. Fleet Aggregation and Map Generation
Each participating vehicle transmits compressed friction estimates (road segment ID, timestamp, estimated μ, confidence score, vehicle class hash) to a cloud aggregation service at approximately 50 bytes per 10-meter segment. At 100 km/h, this produces roughly 280 estimates per minute per vehicle, totaling approximately 14 KB/min of uplink bandwidth.
The aggregation service maintains a spatiotemporal friction map indexed by H3 hexagonal grid cells at resolution 15 (approximately 0.9 m², aggregated to 10 m road segments using a map-matched road graph). For each road segment, the service computes a Bayesian posterior friction estimate using a hierarchical model that accounts for: per-vehicle calibration offsets (learned online), tire degradation drift (modeled as a slow random walk), weather-dependent temporal dynamics (friction drops rapidly with onset of rain, recovers slowly as road dries), and spatial autocorrelation (adjacent segments on the same road surface type have correlated friction).
The posterior μ estimate and its 95% credible interval are served via a tile-based REST API compatible with standard mapping SDKs. Latency from vehicle measurement to map update: target 30-90 seconds. Update frequency: continuous, with each road segment's estimate refreshed whenever a participating vehicle traverses it.
6. Privacy Architecture
Individual vehicle trajectories are never transmitted or stored. The on-vehicle processing module computes friction estimates locally and transmits only the road segment ID (derived from GPS position snapped to the road graph), the friction estimate, and a one-way hash of the vehicle class (sedan/SUV/truck, tire size category). No VIN, no precise location trace, no acoustic recordings leave the vehicle. The aggregation service operates on anonymized, segment-level data.
Differential privacy is applied at the aggregation layer: Laplacian noise with ε = 1.0 is added to per-segment vehicle counts before any analytics are computed, ensuring that the presence or absence of a single vehicle's data cannot be determined from the published map.
7. Applications
- Autonomous vehicle motion planning: Self-driving systems currently assume a conservative friction coefficient (typically μ = 0.3 for wet roads) because they cannot measure it directly. Real-time friction maps enable speed and braking profiles matched to actual road conditions, reducing unnecessary deceleration events by an estimated 15-30% while maintaining safety margins.
- ADAS forward collision warning tuning: Adaptive cruise control and automatic emergency braking can adjust their time-to-collision thresholds based on the friction coefficient of the road ahead (obtained from the map, not just the road under the vehicle), providing earlier warnings on icy surfaces and reduced false alarms on dry pavement.
- DOT pavement maintenance prioritization: Continuous friction maps identify segments with declining friction trends (polishing, rutting, bleeding) months before they would be caught by biennial skid trailer surveys, enabling condition-based maintenance scheduling.
- Insurance telematics: Usage-based insurance programs can adjust risk scores based on the friction conditions a driver routinely encounters, distinguishing drivers who commute on well-maintained highways from those navigating unpaved or poorly maintained roads.
- Navigation rerouting: When friction drops below a configurable threshold on a route (bridge deck ice, fresh oil spill), the navigation system can suggest alternate routes through segments with higher measured friction.
8. Figures Description
- Figure 1: System architecture showing on-vehicle sensor inputs (cabin microphones, wheel speed sensors, IMU), edge processing pipeline, compressed telemetry uplink, cloud aggregation service, and downstream consumer APIs.
- Figure 2: Log-mel spectrograms of tire-road acoustic emission at 80 km/h for four road surface conditions: dry dense-graded asphalt (DGAC), wet DGAC, dry polished concrete, and ice-covered asphalt, showing distinctive spectral shapes in the 200-4000 Hz band.
- Figure 3: Contrastive learning training scheme: positive pairs formed from different vehicles on the same road segment, negative pairs from segments with known friction differences, and the learned embedding space with friction coefficient as a smooth gradient.
- Figure 4: Example real-time friction map of a metropolitan freeway network, showing segment-level friction coefficients color-coded from red (μ < 0.3) through yellow (μ = 0.5) to green (μ > 0.7), with bridge decks and shaded segments showing lower friction during a morning frost event.
Claims
- A system for estimating road surface friction coefficients, comprising: one or more vehicle cabin microphones capturing tire-road acoustic emission; wheel speed sensors on each wheel of the vehicle; an on-vehicle processor that extracts acoustic spectral features and wheel speed micro-slip features from said sensors during normal driving without requiring braking or high-lateral-acceleration events; and a trained neural network model that maps said features to an estimated friction coefficient for the road segment currently being traversed.
- The system of claim 1, wherein the acoustic spectral features comprise log-mel spectrograms, spectral centroid, spectral spread, spectral rolloff, spectral flux, and mel-frequency cepstral coefficients computed from audio bandpass-filtered to the 200-4000 Hz tire-road interaction band.
- The system of claim 1, wherein the micro-slip features comprise per-wheel longitudinal slip ratio statistics, power spectral density of slip ratio fluctuations in the 1-50 Hz band, and cross-correlation between left and right wheel slip ratios.
- The system of claim 1, wherein the neural network model is trained using a self-supervised contrastive learning objective that treats feature vectors from different vehicles traversing the same road segment within a temporal coherence window as positive pairs and feature vectors from road segments with known friction differences as negative pairs.
- The system of claim 4, wherein the temporal coherence window is configurable and defaults to 15 minutes, and positive pair formation requires that no precipitation state change has occurred between the two vehicle traversals as determined by rain sensor data or weather API lookup.
- A method for generating real-time crowdsourced road friction maps, comprising: collecting friction coefficient estimates from a fleet of participating vehicles, each estimate associated with a map-matched road segment identifier and timestamp; aggregating estimates for each road segment using a Bayesian hierarchical model that accounts for per-vehicle calibration offsets, tire degradation drift, weather-dependent temporal dynamics, and spatial autocorrelation; and serving the posterior friction estimates and credible intervals via a tile-based API to navigation systems and autonomous driving stacks.
- The method of claim 6, wherein individual vehicle trajectories are never transmitted, and differential privacy with configurable epsilon is applied to per-segment vehicle counts at the aggregation layer.
- The method of claim 6, wherein the Bayesian hierarchical model incorporates weather station precipitation and temperature data as covariates to improve temporal interpolation between vehicle traversals on each road segment.
- The system of claim 1, wherein for vehicles equipped with active noise cancellation reference microphones mounted in wheel wells or on suspension components, features are extracted from both cabin and reference microphones and concatenated to improve signal-to-noise ratio for tire-road acoustic characterization.
- A method for calibrating a self-supervised friction estimation model, comprising: pre-training a feature encoder using the contrastive objective of claim 4; fitting a linear regression head from the learned embedding to friction coefficient using sparse ground truth obtained from municipal ASTM E274 skid trailer surveys, weather-correlated friction lookup tables, and ABS/ESC activation events that provide instantaneous friction estimates at the point of wheel lockup or stability intervention; and periodically updating the linear head as new sparse ground truth becomes available without retraining the encoder.
Implementation Notes
The acoustic feature extraction and micro-slip analysis pipeline consumes approximately 150 MFLOPS, well within the capacity of any automotive-grade SoC deployed since 2020. The contrastive encoder model has approximately 1.2 million parameters (4.8 MB in FP32, 1.2 MB quantized to INT8), small enough for on-device inference on infotainment processors without impacting other workloads. Training the contrastive encoder requires fleet-scale data (target: 10 million positive pairs from 50,000+ vehicles over 30 days) and is performed in the cloud, with model updates distributed to vehicles via OTA firmware updates at approximately monthly cadence.
The minimum viable deployment requires vehicles with at least one cabin microphone and ABS wheel speed sensors. This covers approximately 85% of new vehicles sold in the US since 2020. Vehicles with road noise cancellation systems (estimated 25% of MY2025 production) provide higher-quality acoustic data due to reference microphone placement closer to the tire-road interface.
Prior Art References
- FHWA — Wet pavement crash statistics: ~860,000 crashes, 4,700 fatalities annually
- FHWA Road Weather — Weather-related crash data including black ice fatalities
- US8626454B2 (Continental, 2014) — Friction estimation using ABS activation events
- Acosta et al., MSSP 2020 — Survey of tire-road friction estimation methods
- ASTM E274 — Standard test method for skid resistance using locked-wheel trailer
- Roychowdhury et al., IEEE T-ITS 2021 — CNN-based road surface classification from cameras
- Sandberg & Ejsmont, Applied Acoustics 2018 — Tire-road noise correlation with macrotexture (R² = 0.82)
- Rado, TRR 1996 — Macrotexture-friction correlation at highway speeds
- Sandberg speed exponent — Tire-road noise speed dependence (~30th power law)
- Cai et al., 2023 — Cooperative road friction estimation from wheel speed data
- Hippi & Kangas, TRR 2015 — Friction-temperature-precipitation relationship curves
- H3 Hexagonal Grid — Uber spatial indexing system for geospatial tiling
- Bose QuietComfort RNC — Road noise cancellation with reference microphones
- Gustafsson et al., Applied Sciences 2017 — Road friction virtual sensing review
- Grip Tester — Continuous friction measurement device