Enable micrograd to use tanh activation function dynamically #88

SauravP97 · 2025-02-08T07:39:46Z

I was working with micrograd and the loss values were not converging when I was training it on one of my datasets. Looks like the tutorial encourage the use of tanh() as an activation function but the repo lacked implementation.

The PR includes the following changes:

Implementation of tanh() activation function and allowing the end users to opt for whether they want to use tanh() or stay with relu() as their choice of activation function at the time of initializing the Multi Layer Perceptron.
Enable support of Label for the Value. This helps in debugging when used with Digraph :)

Please review when you get sometime @karpathy.
Big fan! :)

SauravP97 added 4 commits February 8, 2025 12:56

Allow micrograd to use Tanh activation function dynamically

81d6354

updated comments

270aeb0

addech non linear check while applying an activation function

e355fea

fixed typo

5da3e67

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable micrograd to use tanh activation function dynamically #88

Enable micrograd to use tanh activation function dynamically #88

Uh oh!

SauravP97 commented Feb 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Enable micrograd to use tanh activation function dynamically #88

Are you sure you want to change the base?

Enable micrograd to use tanh activation function dynamically #88

Uh oh!

Conversation

SauravP97 commented Feb 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant