Add pipeline tag, links to paper and code repository

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +42 -16
README.md CHANGED
@@ -1,40 +1,54 @@
1
  ---
 
2
  library_name: transformers
3
  license: mit
4
- base_model: roberta-base
5
- tags:
6
- - generated_from_trainer
7
  metrics:
8
  - accuracy
 
 
9
  model-index:
10
  - name: roberta-sentence-classifier
11
  results: []
12
  ---
13
 
14
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
- should probably proofread and complete it, then remove this comment. -->
16
-
17
  # roberta-sentence-classifier
18
 
19
- This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on the None dataset.
20
- It achieves the following results on the evaluation set:
21
- - Loss: 0.6266
22
- - Accuracy: 0.7990
23
- - Macro F1: 0.7614
24
- - Micro F1: 0.7990
25
- - Qwk: 0.6588
26
 
27
  ## Model description
28
 
29
- More information needed
 
 
 
 
 
 
 
 
 
 
30
 
31
  ## Intended uses & limitations
32
 
33
- More information needed
34
 
35
  ## Training and evaluation data
36
 
37
- More information needed
 
 
 
 
 
 
 
38
 
39
  ## Training procedure
40
 
@@ -66,3 +80,15 @@ The following hyperparameters were used during training:
66
  - Pytorch 2.9.0+cu126
67
  - Datasets 4.0.0
68
  - Tokenizers 0.22.1
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ base_model: roberta-base
3
  library_name: transformers
4
  license: mit
5
+ pipeline_tag: text-classification
 
 
6
  metrics:
7
  - accuracy
8
+ tags:
9
+ - generated_from_trainer
10
  model-index:
11
  - name: roberta-sentence-classifier
12
  results: []
13
  ---
14
 
 
 
 
15
  # roberta-sentence-classifier
16
 
17
+ This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) presented in the paper **[Cognitive Episodes in LLM Reasoning Traces Enable Interpretable Human Item Difficulty Prediction](https://huggingface.co/papers/2606.28186)**.
18
+
19
+ It serves as the sentence-level cognitive episode tagger in the **Epi2Diff** (Episode to Difficulty) framework. It maps Large Reasoning Model (LRM) reasoning traces into cognitively grounded episode sequences to support interpretable modeling of human item difficulty.
20
+
21
+ - **Repository:** [c-steve-wang/Epi2Diff](https://github.com/c-steve-wang/Epi2Diff)
22
+ - **Paper:** [Cognitive Episodes in LLM Reasoning Traces Enable Interpretable Human Item Difficulty Prediction](https://huggingface.co/papers/2606.28186)
 
23
 
24
  ## Model description
25
 
26
+ The model classifies sentence-level reasoning units into 8 problem-solving cognitive episode states:
27
+ - `Read`
28
+ - `Analyze`
29
+ - `Plan`
30
+ - `Implement`
31
+ - `Explore`
32
+ - `Verify`
33
+ - `Monitor`
34
+ - `Answer`
35
+
36
+ These classified sequences are then used by the Epi2Diff framework to extract compact episode-dynamic process features for downstream item difficulty prediction.
37
 
38
  ## Intended uses & limitations
39
 
40
+ You can use this model to segment and tag raw reasoning traces into functional problem-solving states to evaluate reasoning behaviors, perform interpretability studies, or support downstream educational measurement tasks.
41
 
42
  ## Training and evaluation data
43
 
44
+ The model was fine-tuned on annotated reasoning trace sentences derived from datasets such as SAT Math, SAT Reading & Writing, Cambridge, and USMLE.
45
+
46
+ It achieves the following results on the evaluation set:
47
+ - Loss: 0.6266
48
+ - Accuracy: 0.7990
49
+ - Macro F1: 0.7614
50
+ - Micro F1: 0.7990
51
+ - Qwk: 0.6588
52
 
53
  ## Training procedure
54
 
 
80
  - Pytorch 2.9.0+cu126
81
  - Datasets 4.0.0
82
  - Tokenizers 0.22.1
83
+
84
+ ## Citation
85
+
86
+ If you use this model, please cite:
87
+
88
+ ```bibtex
89
+ @misc{wang2026epi2diff,
90
+ title = {Cognitive Episodes in LLM Reasoning Traces Enable Interpretable Human Item Difficulty Prediction},
91
+ author = {Wang, Chenguang and Li, Ming and Zeng, Xinyue and Li, Zhuochun and Jiao, Hong and Zhou, Tianyi and Zhou, Dawei},
92
+ year = {2026}
93
+ }
94
+ ```
Free AI Image Generator No sign-up. Instant results. Open Now