Nellyw888 commited on
Commit
da6a8a0
·
verified ·
1 Parent(s): da3ed43

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -0
README.md CHANGED
@@ -16,6 +16,8 @@ base_model:
16
 
17
  For implementation details, visit our GitHub repository: [VeriReason](https://github.com/NellyW8/VeriReason)
18
 
 
 
19
  ## Update Log
20
  2025.05.17: Initial release of VeriReason-Qwen2.5-3B-Verilog-RTL-GRPO-reasoning-tb
21
 
@@ -80,6 +82,19 @@ The GRPO (Generative Reinforcement Learning from Preference Optimization) traini
80
  ```
81
 
82
  ## Citation
 
 
 
 
 
 
 
 
 
 
 
 
 
83
 
84
  ## Acknowledgement
85
  This repo benefits from OpenR1 and LLamaFactory.
 
16
 
17
  For implementation details, visit our GitHub repository: [VeriReason](https://github.com/NellyW8/VeriReason)
18
 
19
+ Check out our paper: [VeriReason: Reinforcement Learning with Testbench Feedback for Reasoning-Enhanced Verilog Generation](https://arxiv.org/abs/2505.11849)
20
+
21
  ## Update Log
22
  2025.05.17: Initial release of VeriReason-Qwen2.5-3B-Verilog-RTL-GRPO-reasoning-tb
23
 
 
82
  ```
83
 
84
  ## Citation
85
+ Please cite our paper if you use our model or dataset:
86
+
87
+ ```bibtex
88
+ @misc{wang2025verireasonreinforcementlearningtestbench,
89
+ title={VeriReason: Reinforcement Learning with Testbench Feedback for Reasoning-Enhanced Verilog Generation},
90
+ author={Yiting Wang and Guoheng Sun and Wanghao Ye and Gang Qu and Ang Li},
91
+ year={2025},
92
+ eprint={2505.11849},
93
+ archivePrefix={arXiv},
94
+ primaryClass={cs.AI},
95
+ url={https://arxiv.org/abs/2505.11849},
96
+ }
97
+ ```
98
 
99
  ## Acknowledgement
100
  This repo benefits from OpenR1 and LLamaFactory.