Field Notes on Scaling Moe Expert Parallelism with DeepEP
Posted 9 hours ago by
PaulHoule
1
points
https://nousresearch.com/moe-scaling-field-notes/
0
comments