kyo kyo-takano

25 followers · 0 following

@kyo_takano

View GitHub Profile

Recently created

Least recently created

Recently updated

Least recently updated

kyo-takano / the-poor-mans-guide-to-cloud-gpu-selection.md

Created January 26, 2026 10:53

The Poor Man’s Guide to Cloud GPU Selection

Compute obtained per dollar varies significantly by GPU and arithmetic intensity. According to Runpod's pricing, when pre-training LLMs with `batch_size=1024` (tokens), the L4 offers superior cost-performance for models under 0.5B parameters, while the H100 dominates for larger scales.

The Poor Man’s Guide to Cloud GPU Selection

kyo-takano / making-the-most-of-local-llms.ipynb

Last active November 7, 2025 04:00

ローカルLLMはこーやって使うの💢

Sorry, something went wrong. Reload?

Sorry, we cannot display this file.

Sorry, this file is invalid so it cannot be displayed.

kyo-takano / introduction-to-ternary-neural-networks.ipynb

Last active October 10, 2025 17:01

introduction-to-ternary-neural-networks.ipynb

Sorry, something went wrong. Reload?

Sorry, we cannot display this file.

Sorry, this file is invalid so it cannot be displayed.

yoavg / GM-level-chess-without-search.md

Last active December 26, 2025 00:20

Grand-master Level Chess without Search

Grand-master Level Chess without Search: Modeling Choices and their Implications

Yoav Golderg, February 2024.

Researchers at Google DeepMind released a paper about a learned systems that is able to play blitz-chess at a grandmaster level, without using search. This is interesting and imagination-capturing, because up to now computer-chess systems that play at this level, either based on machine-learning or not, did use a search component.[^1]

Indeed, my first reaction when reading the paper was to tweet wow, crazy and interesting. I still find it crazy and interesting, but upon a closer read, it may not be as crazy and as interesting as I initially thought. Many reactions on twitter, reddit, etc, were super-impressed, going into implications about projected learning abilities of AI systems, the ability of neural networks to learn semantics from observations, etc, which are really over-the-top. The paper does not claim any of them, but they are still perceiv

kyo-takano / few-shot-learning-on-function-calling.ipynb

Last active July 10, 2025 23:32

few-shot-learning-on-function-calling.ipynb

Sorry, something went wrong. Reload?

Sorry, we cannot display this file.

Sorry, this file is invalid so it cannot be displayed.