An LPU inference engine

Start new thread

Groq® - Hyperfast LLM running on custom built GPUs

Raycast

•2yr ago

An LPU Inference Engine, with LPU standing for Language Processing Unit™, is a new type of end-to-end processing unit system that provides the fastest inference at ~500 tokens/second.

Replies

Best

This seems extremely interesting- I’m curious what you’ve seen to be the biggest use case for this LLM?

Report

2yr ago

It is fast, that is for sure. Where can I get more information about the chips and hardware? Is there a GPU cloud service?

Report

2yr ago

@amrutha_killada1 oh, thank you. Will dig in.

Report

2yr ago

Tradepost.ai

Congratulations! speed/accuracy is incredible, no wonder NVDA took a dip 😯

Report

2yr ago

Locofy.ai

Groq is a promising product, and I believe your detailed insights could attract even more supporters, helping people better understand its value.

Report

2yr ago

Man, that IS fast...Already loving it : )

Report

2yr ago

Videotok

this will be incredible for the future of LLMs and all the products benefiting from them. super excited with all the new things that will come

Report

2yr ago

Locofy.ai

Congrat on the launch? Do you have any plan when to support custom training?

Report

2yr ago

Avatars by Studio Neiro AI

Good luck! I am really excited about this hardware stuff for LLMs!

Report

2yr ago

Crustdata

Going to give this a try, team Groq®. Looks interesting.

Report

2yr ago

Don't know why its ranked so low as of now, the speed is awesome. It does what it says.

Report

2yr ago

1 2