File size: 2,901 Bytes

---
library_name: transformers
tags:
- sentence-classification
- korean
- multi-class
- ko-sroberta
- transformers
---

# Model Card for Sentence Type Classification

This model is fine-tuned to classify Korean financial sentences into four categories: Predictive, Inferential, Factual, and Conversational. It is built upon `jhgan/ko-sroberta-multitask`, a multilingual transformer model specialized for Korean NLP tasks.

## Model Details

### Model Description

- **Developed by:** Kwon Cho
- **Shared by:** kwoncho
- **Model type:** RoBERTa-based transformer (fine-tuned for sequence classification)
- **Language(s):** Korean (한국어)
- **License:** Apache 2.0 (from base model)
- **Finetuned from model:** [`jhgan/ko-sroberta-multitask`](https://huggingface.co/jhgan/ko-sroberta-multitask)

This model was fine-tuned for multi-class classification using supervised learning with Hugging Face Transformers and PyTorch.

### Model Sources

- **Repository:** [More Information Needed]
- **Demo:** [More Information Needed]

## Uses

### Direct Use

The model can be used to classify financial sentences (in Korean) into one of the following categories:
- **Predictive** (예측형)
- **Inferential** (추론형)
- **Factual** (사실형)
- **Conversational** (대화형)

### Training Data

- **Dataset Name:** 문장 유형(추론, 예측 등) 판단 데이터  
- **출처:** [AIHub 링크](https://www.aihub.or.kr/aihubdata/data/view.do?pageIndex=1&currMenu=115&topMenu=100&srchOptnCnd=OPTNCND001&searchKeyword=예측형&srchDetailCnd=DETAILCND001&srchOrder=ORDER001&srchPagePer=20&srchDataRealmCode=REALM002&aihubDataSe=data&dataSetSn=71486)

이 데이터는 한국어 금융 문장을 다음 네 가지 유형으로 분류합니다:
- `예측형 (Predictive)`
- `추론형 (Inferential)`
- `사실형 (Factual)`
- `대화형 (Conversational)`

### Out-of-Scope Use

- Not suitable for general-purpose Korean sentence classification outside financial or economic contexts.
- May not perform well on informal or highly colloquial text.

## Bias, Risks, and Limitations

- The model may carry biases present in the training dataset.
- Misclassifications could have downstream implications if used for investment recommendations or financial analysis without verification.

### Recommendations

Use this model in conjunction with human oversight, especially for high-stakes or production-level applications.

## How to Get Started with the Model

```python
from transformers import AutoTokenizer, AutoModelForSequenceClassification
tokenizer = AutoTokenizer.from_pretrained("kwoncho/sentence_type_classification")
model = AutoModelForSequenceClassification.from_pretrained("kwoncho/sentence_type_classification")

text = "해당 종목은 단기적으로 하락할 가능성이 있습니다."
inputs = tokenizer(text, return_tensors="pt", truncation=True, padding=True)
outputs = model(**inputs)