LogoThread Easy
  • 탐색
  • 스레드 작성
LogoThread Easy

트위터 스레드의 올인원 파트너

© 2025 Thread Easy All Rights Reserved.

탐색

Newest first — browse tweet threads

Keep on to blur preview images; turn off to show them clearly

RT @liuzhuang1234: Stronger Normalization-Free Transformers – new paper.

We introduce Derf (Dynamic erf), a simple point-wise layer that l…

RT @liuzhuang1234: Stronger Normalization-Free Transformers – new paper. We introduce Derf (Dynamic erf), a simple point-wise layer that l…

Principal Researcher at Tencent HY, Prev. Ph.D. student (学渣) at NUS, focus on parameter generation.

avatar for Victor.Kai Wang
Victor.Kai Wang
Mon Dec 15 15:19:58
  • Previous
  • 1
  • Next