|
--- |
|
license: mit |
|
datasets: |
|
- MuzzammilShah/people-names |
|
language: |
|
- en |
|
model_name: Multi-Layer Perceptron (MLP) Language Model |
|
library_name: pytorch |
|
tags: |
|
- makemore |
|
- mlp |
|
- language-model |
|
- andrej-karpathy |
|
--- |
|
|
|
# Multi-Layer Perceptron Language Model: Makemore (Part 2) |
|
|
|
In this repository, a **Multi-Layer Perceptron (MLP)** language model inspired by the *Bengio et al. (2003)* research paper has been implemented for **character-level predictions**, following Andrej Karpathy's approach in the **Makemore - Part 2** video. |
|
|
|
## Overview |
|
The implementation demonstrates building and training the MLP model for sequence prediction while further enhancing the understanding of neural network architectures for language modeling. |
|
|
|
## Documentation |
|
For a better reading experience and detailed notes, visit my **[Road to GPT Documentation Site](https://muzzammilshah.github.io/Road-to-GPT/Makemore-part2/)**. |
|
|
|
## Acknowledgments |
|
Notes and implementations inspired by the **Makemore - Part 2** video by [Andrej Karpathy](https://karpathy.ai/). |
|
|
|
For more of my projects, visit my [Portfolio Site](https://muhammedshah.com). |