MuzzammilShah
/

NeuralNetworks-LanguageModels-2

andrej-karpathy

Model card Files Files and versions

NeuralNetworks-LanguageModels-2 / README.md

MuzzammilShah's picture

Update README.md

de1378e verified 3 months ago

|

history blame contribute delete

1.12 kB

	---
	license: mit
	datasets:
	- MuzzammilShah/people-names
	language:
	- en
	model_name: Multi-Layer Perceptron (MLP) Language Model
	library_name: pytorch
	tags:
	- makemore
	- mlp
	- language-model
	- andrej-karpathy
	---

	# Multi-Layer Perceptron Language Model: Makemore (Part 2)

	In this repository, a Multi-Layer Perceptron (MLP) language model inspired by the Bengio et al. (2003) research paper has been implemented for character-level predictions, following Andrej Karpathy's approach in the Makemore - Part 2 video.

	## Overview
	The implementation demonstrates building and training the MLP model for sequence prediction while further enhancing the understanding of neural network architectures for language modeling.

	## Documentation
	For a better reading experience and detailed notes, visit my [Road to GPT Documentation Site](https://muzzammilshah.github.io/Road-to-GPT/Makemore-part2/).

	## Acknowledgments
	Notes and implementations inspired by the Makemore - Part 2 video by [Andrej Karpathy](https://karpathy.ai/).

	For more of my projects, visit my [Portfolio Site](https://muhammedshah.com).