Co-Designing Communication Middleware And Deep Learning Frameworks For High-Performance Dnn Training On Hpc Systems