LogoThread Easy
  • 発見
  • スレッド作成
LogoThread Easy

Twitter スレッドの万能パートナー

© 2025 Thread Easy All Rights Reserved.

探索

Newest first — browse tweet threads

Keep on to blur preview images; turn off to show them clearly

math provers remain my standard meter bar. if goedel hit sota on 32b, this is probably all you need to solve hardest problems at the moment.

math provers remain my standard meter bar. if goedel hit sota on 32b, this is probably all you need to solve hardest problems at the moment.

Reasoning models coming (very) soon. Co-founder @pleiasfr

avatar for Alexander Doria
Alexander Doria
Sat Nov 01 09:57:54
interestingly, as use case become more complex and mature, first time i’m becoming constrained by model size. 30b dense or 50-150b active is likely becoming a sweet spot.

interestingly, as use case become more complex and mature, first time i’m becoming constrained by model size. 30b dense or 50-150b active is likely becoming a sweet spot.

math provers remain my standard meter bar. if goedel hit sota on 32b, this is probably all you need to solve hardest problems at the moment.

avatar for Alexander Doria
Alexander Doria
Sat Nov 01 09:53:42
hyped

(but decidedly won’t escape the big dive in moe finetuning)

hyped (but decidedly won’t escape the big dive in moe finetuning)

interestingly, as use case become more complex and mature, first time i’m becoming constrained by model size. 30b dense or 50-150b active is likely becoming a sweet spot.

avatar for Alexander Doria
Alexander Doria
Sat Nov 01 09:49:03
"I just don't see Cursor or Windsurf having enough cashflow on-hand to build a training cluster and gather all the data and teams you need to build a model from nothing." Meanwhile Chinese food delivery app.

"I just don't see Cursor or Windsurf having enough cashflow on-hand to build a training cluster and gather all the data and teams you need to build a model from nothing." Meanwhile Chinese food delivery app.

Reasoning models coming (very) soon. Co-founder @pleiasfr

avatar for Alexander Doria
Alexander Doria
Sat Nov 01 09:45:15
"I just don't see Cursor or Windsurf having enough cashflow on-hand to build a training cluster and gather all the data and teams you need to build a model from nothing." Meanwhile Chinese food delivery app.

"I just don't see Cursor or Windsurf having enough cashflow on-hand to build a training cluster and gather all the data and teams you need to build a model from nothing." Meanwhile Chinese food delivery app.

Reasoning models coming (very) soon. Co-founder @pleiasfr

avatar for Alexander Doria
Alexander Doria
Sat Nov 01 09:45:15
"I just don't see Cursor or Windsurf having enough cashflow on-hand to build a training cluster and gather all the data and teams you need to build a model from nothing." Meanwhile Chinese food delivery app.

"I just don't see Cursor or Windsurf having enough cashflow on-hand to build a training cluster and gather all the data and teams you need to build a model from nothing." Meanwhile Chinese food delivery app.

Reasoning models coming (very) soon. Co-founder @pleiasfr

avatar for Alexander Doria
Alexander Doria
Sat Nov 01 09:45:15
  • Previous
  • 1
  • More pages
  • 1449
  • 1450
  • 1451
  • More pages
  • 2111
  • Next