Merge branch 'main' of github.com:Graph-Machine-Learning-Group/graph-machine-learning-group.github.io

dzambon · dzambon · commit 20d8bba72948 · 2025-10-09T15:21:31.000+02:00
diff --git a/_data/news.yml b/_data/news.yml
@@ -1,4 +1,10 @@
-- date: 2025/06
+- date: 2025/10
+  text: >
+    Two new preprints about our latest research!
+    <a href="https://arxiv.org/abs/2509.24728">Beyond Softmax: A Natural Parameterization for Categorical Random Variables</a> (Manenti and Alippi) and 
+    <a href="https://arxiv.org/abs/2507.23604">Hierarchical Message-Passing Policies for Multi-Agent Reinforcement Learning</a> (Marzi et al).
+    Check them out!
+- date: 2025/09
   text: 'Our papers <a href="https://arxiv.org/abs/2506.15507">Over-squashing in Spatiotemporal Graph Neural Networks</a> (Marisca et al.) and <a href="#">Equilibrium Policy Generalization: A Reinforcement Learning Framework for Cross-Graph Zero-Shot Generalization in Pursuit-Evasion Games</a> (Lu et al.) have been accepted at <strong><a href="https://neurips.cc">NeurIPS 2025</a></strong>!'
 - date: 2025/06
   text: >
@@ -9,7 +15,7 @@
 - date: 2025/06
   text: >
     In collaboration with <strong>MeteoSwiss</strong>, we have released <a href="https://arxiv.org/abs/2506.13652"><strong>PeakWeather</strong></a> - a high-resolution benchmark <strong>dataset</strong> for spatiotemporal weather modeling from ground measuments. Check it out on <a href="https://huggingface.co/datasets/MeteoSwiss/PeakWeather">Hugging Face</a>!
-- date: 2025/06
+- date: 2025/05
   text: 'Our paper <a href="https://doi.org/10.1145/3742784">Graph Deep Learning for Time Series Forecasting (Cini et al.)</a> has been accepted to <strong><a href="https://dl.acm.org/journal/csur">ACM Computing Surveys</a></strong>!'
 - date: 2025/05
   text: 'Our papers <a href="https://arxiv.org/abs/2405.19933">Learning Latent Graph Structures and their Uncertainty (Manenti et al.)</a> and <a href="http://arxiv.org/abs/2502.09443">Relational Conformal Prediction for Correlated Time Series (Cini et al.)</a> have been accepted at <strong><a href="https://icml.cc/Conferences/2025">ICML 2025</a></strong>!'
diff --git a/_data/people.yml b/_data/people.yml
@@ -68,7 +68,7 @@
   surname: Manenti
   group: team
   role: Ph.D. Student
-  description: He studies spatiotemporal data processing using graph latent spaces.
+  description: He studies methods for learning latent variables more accurately and efficiently.
   links:
     website: https://allemanenti.github.io/
     github: allemanenti
diff --git a/_data/publications.yaml b/_data/publications.yaml
@@ -21,6 +21,25 @@
     }
   links:
     paper: https://arxiv.org/abs/2510.06819
+- title: "Beyond Softmax: A Natural Parameterization for Categorical Random Variables"
+  venue: Preprint
+  year: 2025
+  authors:
+  - id:amanenti
+  - id:calippi
+  keywords:
+    - probabilistic modeling
+    - gradient-based optimization
+    - graph structure learning
+    - latent random variables
+  abstract: 'Latent categorical variables are frequently found in deep learning architectures. They can model actions in discrete reinforcement-learning environments, represent  categories in latent-variable models, or express relations in graph neural networks. Despite their widespread use, their discrete nature poses significant challenges to gradient-descent learning algorithms. While a substantial body of work has offered improved gradient estimation techniques, we take a complementary approach. Specifically, we: 1) revisit the ubiquitous softmax function and demonstrate its limitations from an information-geometric perspective; 2) replace the softmax with the catnat function, a function composed by a sequence of hierarchical binary splits; we prove that this choice offers significant advantages to gradient descent due to the resulting diagonal Fisher Information Matrix. A rich set of experiments - including graph structure learning, variational autoencoders, and reinforcement learning - empirically show that the proposed function improves the learning efficiency and yields models characterized by consistently higher test performance. Catnat is simple to implement and seamlessly integrates into existing codebases. Moreover, it remains compatible with standard training stabilization techniques and, as such, offers a better alternative to the softmax function.'
+  bibtex: >
+    @article{manenti2025beyond,
+      title={Beyond Softmax: A Natural Parameterization for Categorical Random Variables}, 
+      author={Alessandro Manenti and Cesare Alippi},
+      journal={arXiv preprint arXiv:2509.24728},
+      year={2025}
+    }
 - title: "Online Continual Graph Learning"
   venue: Preprint
   year: 2025
@@ -182,6 +201,7 @@
     - graph structure learning
     - graph neural networks
     - model calibration
+    - probabilistic modeling
   abstract: Within a prediction task, Graph Neural Networks (GNNs) use relational information as an inductive bias to enhance the model's accuracy. As task-relevant relations might be unknown, graph structure learning approaches have been proposed to learn them while solving the downstream prediction task. In this paper, we demonstrate that minimization of a point-prediction loss function, e.g., the mean absolute error, does not guarantee proper learning of the latent relational information and its associated uncertainty. Conversely, we prove that a suitable loss function on the stochastic model outputs simultaneously grants (i) the unknown adjacency matrix latent distribution and (ii) optimal performance on the prediction task. Finally, we propose a sampling-based method that solves this joint learning task. Empirical results validate our theoretical claims and demonstrate the effectiveness of the proposed approach.
   bibtex: >
     @inproceedings{manenti2025learning,