CALSA+ dataset and other additional materials for the paper: Identification of Multiple Logical Interpretations in Counter-Arguments

processed_merged_results_readable.csv: Raw annotation results of three annotators at the predicate-level
all.jsonl: processed predicate-level results, the answer is determined by taking as many YES as possible (i.e., answer = YES if any of the annotators select YES; otherwise, NO)
calsaplus_dataset.jsonl: The CALSA+ dataset where each CA has multiple logical interpretations obtained by aggregating the predicate-level results in all.jsonl
- The ptn numbers are consistent with the patterns of logical structure defined in original CALSA paper: https://aclanthology.org/2024.findings-emnlp.661/
original_calsa_testset.json: The original CALSA test set where CALSA+ is created from.

grpo folder contains all the data and scripts for RLVR experiments

For training models with RLVR, run any of the scripts in grpo/src/scripts/train/, each one is corresponded to a base model
- e.g., cd <where-you-clone-this-repo> && source grpo/src/scripts/train/qwen25-7b-instruct_deepspeed_zero2.sh
For runing inferences, run any of the scripts in grpo/src/scripts/inference/, each one is corresponded to a base model
- e.g., cd <where-you-clone-this-repo> && source grpo/src/scripts/inference/qwen25-7b-instruct_deepspeed_zero2.sh
For running baseline experiments to compare with RLVR, run the corresponding script in grpo/src/scripts/inference/baselines/

sft folder contains all the data and scripts for SFT experiments

For training models with SFT, run any of the scripts in sft/src/scripts/train/, each one is corresponded to a base model
- e.g., cd <where-you-clone-this-repo> && source sft/src/scripts/train/qwen25-7b-instruct_deepspeed_zero2.sh
For runing inferences, run any of the scripts in sft/src/scripts/inference/, each one is corresponded to a base model
- e.g., cd <where-you-clone-this-repo> && source sft/src/scripts/inference/qwen25-7b-instruct_deepspeed_zero2.sh

prompt_enginneering folder contains all the data and scripts for prompting OpenAI models

Usage: python prompt_engineering/src/gpts.py

For evaluting any of the above experiments, run python evaluate_results.py with the corresponding file path where you saved the results

License

The content of this project itself is licensed under the MIT license, and the dataset provided in the folder dataset is licensed under the CC BY 4.0.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
dataset		dataset
grpo		grpo
prompt_engineering		prompt_engineering
sft		sft
LICENSE		LICENSE
README.md		README.md
evaluate_results.py		evaluate_results.py
postprocess.py		postprocess.py
stats.py		stats.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CALSA+ dataset and other additional materials for the paper: Identification of Multiple Logical Interpretations in Counter-Arguments

Contents

License

About

Uh oh!

Releases

Packages

Languages

License

cl-tohoku/ca-multi-ptn

Folders and files

Latest commit

History

Repository files navigation

CALSA+ dataset and other additional materials for the paper: Identification of Multiple Logical Interpretations in Counter-Arguments

Contents

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages