Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.
Why do you think that https://github.com/NVIDIA/Megatron-LM is a good alternative to paxml