A Spatio-Temporal Point Process for Fine-Grained Modeling of Reading Behavior

This repository contains the code for the paper"A Spatio-Temporal Point Process for Fine-Grained Modeling of Reading Behavior", to be presented at ACL 2025.

Below an animation of our hawkes process model evaluated on a reading sequence that was held out from the training and validation set:

⚙️ Environment Setup

To run the experiments, we recommend setting up a Python environment with the required dependencies. You can do this using conda:

# Create a new environment with Python 3.12.7

conda create -n reading-models python=3.12.7 -y

conda activate reading-models


# Install dependencies

pip install -r requirements.txt

🧑‍💻 Example Usage

🌱 Single model run (local)

We recommend starting with the model cards defined in scripts/model_cards.py:

# List all predefined model variants from the paper, grouped in saccade and duration models, on filtered or full (raw) scanpaths
python scripts/launcher_local.py -h

# example: Train & evaluate the hawkes process with a spatial shift, reader specific effects, and word length as predictor on the raw scanpath dataset 
python scripts/launcher_local.py --rme-css-len-freq-raw

# example: Train & evaluate duration model with word suprisal and a convolved past spillover effects
python scripts/launcher_local.py --dur-rme-ws-raw

Each card exposes three boolean flags you can override via the model_cards.py script directly:

training=true|false – perform parameter learning
testing=true|false – evaluate on held‑out split
subset=true|false – restrict to ~2 k events for quick smoke tests

More granular settings (learning rate, kernel size, etc.) live in the dataclass config.py::RunConfig and can be overridden via environment variables or direct YAML edits.

⚡ Hyper‑parameter tuning on HPC Cluster (SLURM)

Our experiments are executed on ETH Zürich’s Euler high-performance computing (HPC) cluster. We define the search grid in scripts/experiments.py, then launch multi‑jobs:

python scripts/cluster_launcher_mjobs.py \
    --model <model_name> \
    --output-dir <output_directory> \
    --partition <partition_name> \
    --account <account_name> \
    --cpus <num_cpus> \
    --gpus <num_gpus> \
    --mem <memory_in_GB> \
    --time <time_limit>

Selecting and Evaluating the Best Models

All experiments are saved under the cluster_runs/ directory, which is organized a follows :

cluster_runs/
├── duration/
│   └── <model_name>_<timestamp>/
│       ├── [folder run 1]
│       ├── [folder run 2]
│       └── ... 
├── saccade/
│   └── <model_name>_<timestamp>/
│       ├── [folder run 1]
│       ├── [folder run 2]
│       └── ...

After running the experiments, we select the best model for each experiment folder based on its performance on the validation set.

To do this, run:

For duration modeling :

python scripts/select_best_val_model.py --duration

For saccade modeling :

python scripts/select_best_val_model.py --saccade

This script saves the best-performing model for each experiment in the best_model/ directory.

Running Test Evaluation

Once the best models are saved, evaluate them on the test set by running:

python scripts/run_test_eval_global.py --root-dir <path-to-dir>

This script:

Loads every best-model checkpoint,
Evaluates it on the test set, and
For the saccade modeling task, creates an animation on a held-out reading session (reader 70, text 3)

(see example animation above).

Data

The data is available at: https://polybox.ethz.ch/index.php/s/ncbLm6ZK9RXiXLF;

Contents of `data.zip`

dataset_cached/
Contains the processed cached dataset.
There are several versions, but they only differ in the scale of the temporal and spatial axes (e.g., seconds vs. milliseconds).
MECO/
- tabular_en/
  Contains .csv files, each representing a reading session for a given reader and text.
- texts_en/sentences_by_char.csv
  A table with the bounding boxes for each text.
- texts_en_images_char_level_surp/ and texts_en_images_word_surp/
  Plots by text of the character level and word level surprisal, respectively.

Citation

If you use this work, please cite it as:

@inproceedings{re2025spatiotemporal,
  title     = {A Spatio-Temporal Point Process for Fine-Grained Modeling of Reading Behavior},
  author    = {Re, Francesco Ignazio and Opedal, Andreas and Manaiev, Glib and Giulianelli, Mario and Cotterell, Ryan},
  booktitle = {Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL)},
  address   = {Vienna, Austria},
  month     = jul,
  year      = {2025},
  url       = {https://arxiv.org/abs/2506.19999}
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
eval		eval
image		image
scripts		scripts
src		src
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

A Spatio-Temporal Point Process for Fine-Grained Modeling of Reading Behavior

⚙️ Environment Setup

🧑‍💻 Example Usage

🌱 Single model run (local)

⚡ Hyper‑parameter tuning on HPC Cluster (SLURM)

Selecting and Evaluating the Best Models

Running Test Evaluation

Data

Contents of `data.zip`

Citation

About

Uh oh!

Releases

Packages

Languages

rycolab/spatio-temporal-reading

Folders and files

Latest commit

History

Repository files navigation

A Spatio-Temporal Point Process for Fine-Grained Modeling of Reading Behavior

⚙️ Environment Setup

🧑‍💻 Example Usage

🌱 Single model run (local)

⚡ Hyper‑parameter tuning on HPC Cluster (SLURM)

Selecting and Evaluating the Best Models

Running Test Evaluation

Data

Contents of data.zip

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Contents of `data.zip`

Packages