289. Pervmom Jun 2026
PervMom injects into video models by turning birth–death pairs from a Vietoris–Rips filtration into learnable momentum vectors . The resulting representation captures how long spatio‑temporal patterns persist, leading to consistent accuracy improvements (≈ 2–4 % absolute) on major action‑recognition benchmarks, with only modest computational overhead.