Hi! thank you for your open-source code and such an excellent work. I've recently been using your modified CUDA library to compute multi-head attention, but I've noticed that some non-existent issues ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results