I was trying a source code on mnist dataset.
I implemented the same architecture, but the code was with 30 epochs and I try 5 epochs for testing.
WTF which gave exactly the same results with 99% accuracy.
Maybe its because of the adam optimizer or the larger data volume 🤔