Skip to content

Accompany material for the paper `Identification of Multiple Logical Interpretations in Counter-Arguments`

License

Notifications You must be signed in to change notification settings

cl-tohoku/ca-multi-ptn

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CALSA+ dataset and other additional materials for the paper: Identification of Multiple Logical Interpretations in Counter-Arguments

Contents

dataset folder contains the CALSA+ dataset and the related files

  • processed_merged_results_readable.csv: Raw annotation results of three annotators at the predicate-level
  • all.jsonl: processed predicate-level results, the answer is determined by taking as many YES as possible (i.e., answer = YES if any of the annotators select YES; otherwise, NO)
  • calsaplus_dataset.jsonl: The CALSA+ dataset where each CA has multiple logical interpretations obtained by aggregating the predicate-level results in all.jsonl
  • original_calsa_testset.json: The original CALSA test set where CALSA+ is created from.

grpo folder contains all the data and scripts for RLVR experiments

  • For training models with RLVR, run any of the scripts in grpo/src/scripts/train/, each one is corresponded to a base model
    • e.g., cd <where-you-clone-this-repo> && source grpo/src/scripts/train/qwen25-7b-instruct_deepspeed_zero2.sh
  • For runing inferences, run any of the scripts in grpo/src/scripts/inference/, each one is corresponded to a base model
    • e.g., cd <where-you-clone-this-repo> && source grpo/src/scripts/inference/qwen25-7b-instruct_deepspeed_zero2.sh
  • For running baseline experiments to compare with RLVR, run the corresponding script in grpo/src/scripts/inference/baselines/

sft folder contains all the data and scripts for SFT experiments

  • For training models with SFT, run any of the scripts in sft/src/scripts/train/, each one is corresponded to a base model
    • e.g., cd <where-you-clone-this-repo> && source sft/src/scripts/train/qwen25-7b-instruct_deepspeed_zero2.sh
  • For runing inferences, run any of the scripts in sft/src/scripts/inference/, each one is corresponded to a base model
    • e.g., cd <where-you-clone-this-repo> && source sft/src/scripts/inference/qwen25-7b-instruct_deepspeed_zero2.sh

prompt_enginneering folder contains all the data and scripts for prompting OpenAI models

  • Usage: python prompt_engineering/src/gpts.py

For evaluting any of the above experiments, run python evaluate_results.py with the corresponding file path where you saved the results

License

The content of this project itself is licensed under the MIT license, and the dataset provided in the folder dataset is licensed under the CC BY 4.0.

About

Accompany material for the paper `Identification of Multiple Logical Interpretations in Counter-Arguments`

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published