Taishi-N324

Taishi Nakamura Taishi-N324

64 followers · 140 following

Achievements

Highlights

Organizations

Stars

instadeepai / sebulba

🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX

Python 57 5 Updated Oct 23, 2023

iliao2345 / CompressARC

Python 96 9 Updated Mar 4, 2025

facebookresearch / swe-rl

Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"

Python 418 30 Updated Mar 1, 2025

NVIDIA / NeMo-Skills

A project to improve skills of large language models

Python 255 50 Updated Mar 8, 2025

mil-tokyo / Megatron-VLM

Python 18 1 Updated Feb 2, 2025

willccbb / claude-code-mcp

Letting Claude Code develop his own MCP tools :)

TypeScript 52 9 Updated Mar 8, 2025

centerforaisafety / hle

Humanity's Last Exam

Python 541 26 Updated Feb 26, 2025

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 608 40 Updated Mar 9, 2025

huggingface / lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 1,262 186 Updated Mar 4, 2025

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 875 53 Updated Mar 4, 2025

cline / cline

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 33,692 3,339 Updated Mar 9, 2025

Shalev-Lifshitz / MultiAgentVerification

Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers

Python 11 1 Updated Mar 1, 2025

srush / GPU-Puzzles

Solve puzzles. Learn CUDA.

Jupyter Notebook 10,649 822 Updated Sep 1, 2024

RL4VLM / RL4VLM

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Jupyter Notebook 313 21 Updated Dec 15, 2024

google-deepmind / rlax

Python 1,293 89 Updated Jan 23, 2025

lmgame-org / GamingAgent

Computer gaming agents that run on your PC and laptops.

Python 412 49 Updated Mar 8, 2025

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 1,088 67 Updated Mar 8, 2025

LazoVelko / Pokemon-Terminal

Pokemon terminal themes.

Python 4,566 243 Updated Jul 18, 2024

klaudiosinani / hyper-pokemon

Tailor-made Pokémon themes for your Hyper terminal

JavaScript 1,233 70 Updated Aug 30, 2024

microsoft / ticl

Tabular In-Context Learning

Jupyter Notebook 53 11 Updated Mar 6, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 1,239 68 Updated Mar 7, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 4,854 474 Updated Mar 8, 2025

PWhiddy / PokemonRedExperiments

Playing Pokemon Red with Reinforcement Learning

Jupyter Notebook 7,214 681 Updated Mar 6, 2025

SWE-Gym / SWE-Gym

Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym

Jupyter Notebook 387 24 Updated Feb 26, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

C++ 11,219 784 Updated Mar 1, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,092 611 Updated Mar 6, 2025

ayaka14732 / tpu-starter

Everything you want to know about Google Cloud TPU

Python 517 30 Updated Jul 16, 2024

MoonshotAI / Moonlight

947 40 Updated Feb 28, 2025

laekov / fastmoe

A fast MoE impl for PyTorch

Python 1,658 192 Updated Feb 10, 2025

Haiyang-W / TokenFormer

[ICLR2025 Spotlight🔥] Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters

Python 532 40 Updated Feb 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Taishi Nakamura Taishi-N324

Achievements

Achievements

Highlights

Organizations

Block or report Taishi-N324

Stars

instadeepai / sebulba

iliao2345 / CompressARC

facebookresearch / swe-rl

NVIDIA / NeMo-Skills

mil-tokyo / Megatron-VLM

willccbb / claude-code-mcp

centerforaisafety / hle

tile-ai / tilelang

huggingface / lighteval

PeterGriffinJin / Search-R1

cline / cline

Shalev-Lifshitz / MultiAgentVerification

srush / GPU-Puzzles

RL4VLM / RL4VLM

google-deepmind / rlax

lmgame-org / GamingAgent

ML-GSAI / LLaDA

LazoVelko / Pokemon-Terminal

klaudiosinani / hyper-pokemon

microsoft / ticl

hiyouga / EasyR1

deepseek-ai / DeepGEMM

PWhiddy / PokemonRedExperiments

SWE-Gym / SWE-Gym

deepseek-ai / FlashMLA

deepseek-ai / DeepEP

ayaka14732 / tpu-starter

MoonshotAI / Moonlight

laekov / fastmoe

Haiyang-W / TokenFormer