Following issue [21](https://github.com/LuxDL/WeightInitializers.jl/issues/21) in WeightInitializers.jl we should do something similar for the initializers that we have here