Cross-City Analysis

Visualizations of how SSL backbones generalize across different cities. Data is sourced live from the experiment database.

Cross-City Transfer Matrix

Best experiment per training city, showing PDM scores on each evaluation city. Diagonal entries (marked) indicate in-distribution performance.

Train CityBostonLas VegasPittsburghSingaporeAvg
all(A3)84.7%91.5%80.3%80.2%85.4%
boston(P1-B1)79.7%77.5%61.2%52.8%71.0%
vegas(P1-B2)69.1%92.0%56.9%52.5%71.7%
pittsburgh(P1-B3)73.2%78.7%73.1%51.9%71.7%
singapore(P1-B4)54.7%66.1%53.7%62.7%59.6%

Backbone Comparison

Average PDM scores grouped by SSL backbone architecture.

ResNet34

15 experiments

Average PDM63.1%
Best PDM85.4%

2 experiments

Average PDM46.4%
Best PDM66.9%

ijepa

10 experiments

Average PDM0.0%
Best PDM0.0%

dinov2

10 experiments

Average PDM0.0%
Best PDM0.0%

mae

10 experiments

Average PDM0.0%
Best PDM0.0%

Sub-Metric Breakdown

Detailed scoring components across completed experiments. NC = No Collision, DAC = Drivable Area Compliance, EP = Ego Progress, TTC = Time to Collision, C = Comfort.

IDBackboneNCDACEPTTCCAvg PDM
A3ResNet340.9790.9290.8610.9690.94285.4%
A3-bResNet340.9750.8550.8570.9670.91079.2%
A4ResNet340.9670.8690.8470.9570.89578.3%
P1-B2ResNet340.9540.7840.8340.9380.94171.7%
P1-B3ResNet340.9520.8090.8470.9370.90471.7%
P1-B1ResNet340.9450.7970.8600.9300.90571.0%
L1-B2ResNet340.9380.7760.8400.9260.88569.1%
P1-B2-bResNet340.9460.7530.8300.9280.93368.6%
A20.9380.7520.8340.9240.92766.9%
P1-B1-bResNet340.9290.7480.8530.9140.84465.0%
L1-B1ResNet340.9270.7380.8560.9110.83864.2%
P1-B4ResNet340.8670.7920.8480.8520.91559.6%
P1-B3-bResNet340.9030.6910.8490.8880.79557.4%
L1-B3ResNet340.8950.6630.8590.8700.84554.5%
L1-B4ResNet340.8520.7020.8360.8370.86050.9%
A10.7200.5380.7540.7100.81425.9%