All activity
Snap is a floating dock for Cursor and Claude Code. Watch productivity reels, take screenshots, speech to text, generate and optimize prompts, copy console errors, visual editing, preview web, and custom action buttons.

SnapThe floating dock for developers
Jaber Jaberleft a comment
built this because i kept alt-tabbing my life away. now everything lives in one dock. try it and lmk what you'd add 🤙

SnapThe floating dock for developers
Forge turns PyTorch models into optimized CUDA and Triton kernels automatically. 32 AI agents run in parallel, each trying different optimization strategies like tensor cores, memory coalescing, and kernel fusion. A judge validates every kernel for correctness before benchmarking. We got 5x faster inference than torch.compile on Llama 3.1 8B and 4x on Qwen 2.5 7B. Works on any PyTorch model. Free trial on one kernel. Full credit refund if we don't beat torch.compile.

Forge AgentSwarm Agents That Turn Slow PyTorch Into Fast GPU Kernels
Jaber Jaberleft a comment
Hey PH! If we don't beat torch.compile you get your credits back!! Real results on B200: Llama 3.1 8B: 5x faster than torch.compile Qwen 2.5 7B: 4x faster SDXL UNet: 3x faster

Forge AgentSwarm Agents That Turn Slow PyTorch Into Fast GPU Kernels
Jaber Jaberleft a comment
2025 was the year of AI agents. 2026 will be the year of swarm agents. Forge is how we're starting it 32 agents competing in parallel to optimize your GPU kernels. Enter a HuggingFace model ID, get optimized CUDA/Triton for every layer. "Full refund if we don't beat torch.compile" Would love your feedback:D

Forge CLISwarm agents optimize CUDA/Triton for any HF/PyTorch model
Forge generates optimized GPU kernels from any PyTorch or HuggingFace model. 32 parallel Coder+Judge agents compete to find the fastest CUDA/Triton implementation. Up to 5× faster than torch.compile(mode='max-autotune') with 97.6% correctness.
Enter HuggingFace model ID, get optimized kernels for every layer. Powered by optimized NVIDIA Nemotron 3 Nano 30B at 250k tokens/sec.
"Full refund if we don't beat torch.compile"

Forge CLISwarm agents optimize CUDA/Triton for any HF/PyTorch model
Jaber Jaberleft a comment
Amazing product from amazing team!!!🔥🔥 Good luck guyss I will definitely try itt

PlanEat AIAI turns your health goals into a 7-day menu & grocery list
Jaber Jaberleft a comment
GPU development shouldn't require switching between five tools just to understand why your kernel is slow RightNow AI is the first GPU-native code editor. It brings real-time profiling, a cycle-accurate emulator, and AI that actually understands GPU kernels—all in one place. profile without leaving your code. emulate H100s without the hardware. ask optimization questions in plain english We...

RightNowAI code editor for GPU kernel development
RightNow AI is the first GPU-native code editor supporting CUDA, Triton, CUTE, and Tilelang. Features agentic hardware-aware AI (Forge), cycle-accurate GPU emulator with 98% accuracy across 86+ NVIDIA architectures, real-time profiling with Nsight Compute integration, and line-by-line performance analysis.
Write, optimize, and profile GPU kernels in any DSL, all in one editor. Supports all NVIDIA GPUs
Free to download with pro features available.

RightNowAI code editor for GPU kernel development
Jaber Jaberleft a comment
Interesting tbh, because i am a technical person this will definitely help me with the commercial side!! Thankss
ZapDigitsQuickly build client-ready marketing dashboards
Jaber Jaberleft a comment
GPU development shouldn't require switching between five tools just to understand why your kernel is slow. RightNow AI is the first GPU-native code editor. It brings real-time profiling, a cycle-accurate emulator, and AI that actually understands CUDA all in one place.Profile without leaving your code. Emulate H100s without the hardware. Ask optimization questions in plain english. Free to...

RightNow CUDA EditorAI code editor for GPU development
RightNow AI is the only AI code editor built specifically for CUDA. Features agentic AI that knows your GPU architecture, real Nsight Compute profiling inline, natural language to NCU commands, and 86+ GPU emulation with <2% error. Benchmark across block sizes, thread counts, and memory layouts. View PTX/SASS assembly side-by-side. Profile multiple GPUs simultaneously. Connect to remote GPUs or run fully local with offline LLM support. Supports all NVIDIA GPUs from GTX 1060 to H100.

RightNow CUDA EditorAI code editor for GPU development
Jaber Jaberleft a comment
I will absolutely give it a try!! Thanks guyss
BeFreedLearn anything with your own personal audio agent
Jaber Jaberleft a comment
Congratulations guyss!! Impressive work

DevReadyKitUI Framework tailored for SaaS & Devtools MVP’s
Jaber Jaberleft a comment
We built RightNow CLI because most coding AIs don’t understand GPUs. They can write Python, but not CUDA, they miss memory coalescing, warp divergence, register pressure, and all the details that make or break GPU performance. RightNow CLI changes that. It’s an AI-powered CLI that can actually reason about GPU architecture. It writes, debugs, and optimizes CUDA kernels natively, no setup, no...

RightNow CLIClaude Code for CUDA, an open-source AI CLI for GPU devs
Claude Code for CUDA. Free AI assistant that actually understands GPU architecture, write, debug, and optimize GPU kernels right from your terminal. Built by RightNow AI, the first GPU-native AI code editor.

RightNow CLIClaude Code for CUDA, an open-source AI CLI for GPU devs








