LogoThread Easy
  • Explorar
  • Criar thread
LogoThread Easy

Seu parceiro completo para threads do Twitter

© 2025 Thread Easy All Rights Reserved.

Explorar

Newest first — browse tweet threads

Keep on to blur preview images; turn off to show them clearly

RT @liuzhuang1234: Stronger Normalization-Free Transformers – new paper.

We introduce Derf (Dynamic erf), a simple point-wise layer that l…

RT @liuzhuang1234: Stronger Normalization-Free Transformers – new paper. We introduce Derf (Dynamic erf), a simple point-wise layer that l…

Principal Researcher at Tencent HY, Prev. Ph.D. student (学渣) at NUS, focus on parameter generation.

avatar for Victor.Kai Wang
Victor.Kai Wang
Mon Dec 15 15:19:58
  • Previous
  • 1
  • Next