Edge (u,v) builds a message from partner state, features & Δt; only the two touched nodes update their memory.
Memories serve directly as node embeddings, fed into decoder with no extra computation at query time.
Edge (u,v,t) is simply appended to G(t) — no computation performed at event time.
GNN runs on full G(t) to produce node embeddings; must re-run for every new query — expensive at inference.
Snapshot (DTDG) and event-based (CTDG) models developed in isolation: limited cross-comparison and no unified evaluation.
Snapshot-based models are ≥ 10× faster at inference than most event-based models.
With UTG training, snapshot-based models match TGN & GraphMixer even on event-based (CTDG) datasets.
NAT & DyGFormer's edge comes from joint neighbourhood features, not from the event-based format: these can be added to snapshot models too.
| Method |
Human readability
|
LLM-native transport
|
Semantic editing
|
Acoustic interpretability
|
Training-free
|
Generative decoding
|
Bandwidth efficiency
|
|
|---|---|---|---|---|---|---|---|---|
| Lossless codec (FLAC, WAV) | ||||||||
| Handcrafted descriptors (MFCC, spectral centroid) | ||||||||
| Neural codec (EnCodec, SoundStream) | ||||||||
| Audio-language tokenizers (AudioLM, TASTE) | ||||||||
| Unconstrained text caption | ||||||||
| LAC (this paper) |
| Dataset |
# Whistles
|
Voc. hours
|
Time span (yrs)
|
Stable pod (# indiv.)
|
Setting
|
Seq. context
|
Open
|
|---|---|---|---|---|---|---|---|
| OpenWhistle Pretraining | ~180,000* | 114.3 | 5.0 | (5) | Semi-nat. | ||
| OpenWhistle Expert subset | 8,354 | 1.9 | 0.42 | (5) | Semi-nat. | ||
| DOLPHINFREE | 4,600 | 7.3 | 2.0 | Wild | |||
| Di Nardo et al., 2025 | 3,111 | 0.6 | 0.003 | (7) | Captive | ||
| Watkins MMSD | 566 | N/R | 70+ | Wild | |||
| Korkmaz et al., 2023 | ~29,000* | 6.8 | 0.07 | Semi-nat. | |||
| Sicily Strait PAM | 14,048 | N/R | 1.2 | Wild | |||
| DCLDE 2011 | 6,011 | 0.7 | 4.0 | Wild | |||
| SDWD | N/R | N/R | 43+ | (293) | Wild (C&R) |