Pervformer May 2026

| Model | Something-Something V2 (Accuracy) | Kinetics-700 (FLOPS) | GPU Memory (128 frames) | | :--- | :--- | :--- | :--- | | TimeSformer | 62.5% | 1.9k G | 42 GB | | VideoMAE | 70.8% | 2.1k G | OOM (>80GB) | | | 74.2% | 980 G | 23 GB |

For years, the computer vision community has debated a fundamental trade-off: pervformer

Note: OOM = Out of Memory on 80GB A100.

Not only is PervFormer than VideoMAE on Sth-Sth V2 (a dataset that requires true temporal reasoning), it does so using half the memory and half the compute. Why This Matters for Production While academic benchmarks are nice, the real win for PervFormer is in edge deployment and real-time systems. | Model | Something-Something V2 (Accuracy) | Kinetics-700

Pervformer May 2026

Support Center

Products

Where To Buy

Company Information

Account