Update README.md
Browse files
README.md
CHANGED
|
@@ -2,26 +2,45 @@
|
|
| 2 |
library_name: keras
|
| 3 |
tags:
|
| 4 |
- structured-data-classification
|
|
|
|
| 5 |
---
|
| 6 |
|
| 7 |
## Model description
|
| 8 |
|
| 9 |
More information needed
|
|
|
|
|
|
|
|
|
|
| 10 |
|
| 11 |
-
|
| 12 |
|
| 13 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 14 |
|
| 15 |
-
##
|
|
|
|
| 16 |
|
| 17 |
-
|
|
|
|
|
|
|
| 18 |
|
| 19 |
## Training procedure
|
| 20 |
-
|
| 21 |
### Training hyperparameters
|
| 22 |
|
| 23 |
The following hyperparameters were used during training:
|
| 24 |
-
- optimizer:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 25 |
- training_precision: float32
|
| 26 |
|
| 27 |
## Training Metrics
|
|
|
|
| 2 |
library_name: keras
|
| 3 |
tags:
|
| 4 |
- structured-data-classification
|
| 5 |
+
- transformer
|
| 6 |
---
|
| 7 |
|
| 8 |
## Model description
|
| 9 |
|
| 10 |
More information needed
|
| 11 |
+
### Keras Implementation of Structured data learning with TabTransformer
|
| 12 |
+
This repo contains the trained model of [Structured data learning with TabTransformer](https://keras.io/examples/structured_data/tabtransformer/#define-dataset-metadata).
|
| 13 |
+
The full credit goes to: [Khalid Salama](https://www.linkedin.com/in/khalid-salama-24403144/)
|
| 14 |
|
| 15 |
+
Spaces Link:
|
| 16 |
|
| 17 |
+
### Model summary:
|
| 18 |
+
- The trained model uses self-attention based Transformers structure following by multiple feed forward layers in order to serve supervised and semi-supervised learning.
|
| 19 |
+
- The model's inputs can contain both numerical and categorical features.
|
| 20 |
+
- All the categorical features will be encoded into embedding vector with the same number of embedding dimensions, before adding (point-wise) with each other and feeding into a stack of Transformer blocks.
|
| 21 |
+
- The contextual embeddings of the categorical features after the final Transformer layer, are concatenated with the input numerical features, and fed into a final MLP block.
|
| 22 |
+
- A SoftMax function is applied at the end of the model.
|
| 23 |
|
| 24 |
+
## Intended uses & limitations:
|
| 25 |
+
- This model can be used for both supervised and semi-supervised tasks on tabular data.
|
| 26 |
|
| 27 |
+
## Training and evaluation data:
|
| 28 |
+
- This model was trained using the [United States Census Income Dataset](https://archive.ics.uci.edu/ml/datasets/census+income) provided by the UC Irvine Machine Learning Repository. The task of the dataset is to predict whether a person is likely to be making over USD 50,000 a year (binary classification).
|
| 29 |
+
- The dataset consists of 14 input features: 5 numerical features and 9 categorical features.
|
| 30 |
|
| 31 |
## Training procedure
|
|
|
|
| 32 |
### Training hyperparameters
|
| 33 |
|
| 34 |
The following hyperparameters were used during training:
|
| 35 |
+
- optimizer: 'AdamW'
|
| 36 |
+
- learning_rate: 0.001
|
| 37 |
+
- weight decay: 1e-04
|
| 38 |
+
- loss: 'sparse_categorical_crossentropy'
|
| 39 |
+
- beta_1: 0.9
|
| 40 |
+
- beta_2: 0.999
|
| 41 |
+
- epsilon: 1e-07
|
| 42 |
+
- epochs: 50
|
| 43 |
+
- batch_size: 16
|
| 44 |
- training_precision: float32
|
| 45 |
|
| 46 |
## Training Metrics
|