DeepAxion
/

distilbert-imdb-sentiment

Text Classification

Model card Files Files and versions

DeepAxion commited on Jul 23

Commit

b1f1815

·

verified ·

1 Parent(s): 633aeab

Update README.md

Files changed (1) hide show

README.md +26 -3

README.md CHANGED Viewed

@@ -100,9 +100,32 @@ print(f"Confidence (Negative): {probabilities[0][0].item():.4f}")
 print(f"Confidence (Positive): {probabilities[0][1].item():.4f}")
 ```
-### Training Details
-## Training Data
 The model was fine-tuned on the IMDb Large Movie Review Dataset. This dataset consists of 50,000 highly polar movie reviews (25,000 for training, 25,000 for testing), labeled as either positive or negative. Reviews with a score of <= 4 out of 10 are labeled negative, and those with a score of >= 7 out of 10 are labeled positive.
-Dataset Card: https://huggingface.co/datasets/ajaykarthick/imdb-movie-reviews (or the official IMDb dataset link if different)

 print(f"Confidence (Positive): {probabilities[0][1].item():.4f}")
 ```
+## Training Details
+### Training Data
 The model was fine-tuned on the IMDb Large Movie Review Dataset. This dataset consists of 50,000 highly polar movie reviews (25,000 for training, 25,000 for testing), labeled as either positive or negative. Reviews with a score of <= 4 out of 10 are labeled negative, and those with a score of >= 7 out of 10 are labeled positive.
+Dataset Card: https://huggingface.co/datasets/ajaykarthick/imdb-movie-reviews (or the official IMDb dataset link if different)
+## Preprocessing
+Text was tokenized using the DistilBertTokenizerFast associated with the base model. Input sequences were truncated to a maximum length of 512 tokens and padded to the longest sequence in the batch. Labels were mapped to 0 for negative and 1 for positive.
+## Training Hyperparameters
+- Training regime: Mixed precision (fp16) was likely used for faster training and reduced memory footprint. (Confirm this if you know your specific training setup)
+- Optimizer: AdamW
+- Learning Rate: Learning rate scheduler is used
+- Epochs: 3
+- Batch Size: 8
+- Hardware: Google Colab A100 GPU
+- Framework: PyTorch
+## Speeds, Sizes, Times
+Training Time: [E.g., Approximately 1-2 hours on a single Colab T4 GPU] (Estimate based on your experience)
+Model Size: The model.safetensors file is approximately 255 MB.