How big should batch size and number of epochs be when fitting a model?

Since you have a pretty small dataset (~ 1000 samples), you would probably be safe using a batch size of 32, which is pretty standard. It won’t make a huge difference for your problem unless you’re training on hundreds of thousands or millions of observations.

To answer your questions on Batch Size and Epochs:

In general: Larger batch sizes result in faster progress in training, but don’t always converge as fast. Smaller batch sizes train slower, but can converge faster. It’s definitely problem dependent.

In general, the models improve with more epochs of training, to a point. They’ll start to plateau in accuracy as they converge. Try something like 50 and plot number of epochs (x axis) vs. accuracy (y axis). You’ll see where it levels out.

What is the type and/or shape of your data? Are these images, or just tabular data? This is an important detail.

Leave a Comment

Hata!: SQLSTATE[HY000] [1045] Access denied for user 'divattrend_liink'@'localhost' (using password: YES)