Great article by the way, both part 1 and part 2,but is it due to the speed of the computation or any other factors that we migrate from typical keras neutral networks to implement the same in pytorch?
And I’m lost near the CUDA concepts in the codes above, may I know what are they?
Do we need special hardware to run pytorch?
Thanks