Will there be AI accelerator cards in the future of gaming PCs?

Port8080@feddit.de · 1 year ago

Will there be AI accelerator cards in the future of gaming PCs?

colournoun@beehaw.org · 1 year ago

Unless the AI processing is much more specialized than graphics, I think manufacturers would put that effort into making more powerful GPUs that can also be used for AI tasks.

TheTrueLinuxDev@beehaw.org · 1 year ago

They would try to alleviate the cost on running GPU by making an AI accelerator chip like Tensor Core, but it’ll get bottleneck by limited VRAM when Neural Net models require steep amount of memory. it’s more productive to have something like NPU that runs either on RAM or by it’s own memory chips offering higher amount of capacity to run such neural net and avoid the roundtrip data copying between GPU and CPU.

Qazwsxedcrfv000@lemmy.unknownsys.com · 1 year ago

Your GPU is an AI accelerator already. Running trained AI models is not as resource demanding as training one. Unless local training becomes universal, AI acclerators for consumers make very few sense.

SkyeStarfall@lemmy.blahaj.zone · 1 year ago

The newest gen GPUs have sections dedicated to AI already, so we effectively already have dedicated AI accelerators.

Qazwsxedcrfv000@lemmy.unknownsys.com · 1 year ago

Yes there are but the op is talking about discrete AI accelerators…

dill@lemmy.one · 1 year ago

It was before my time but… If physX cards are any indication, then no.

Port8080@feddit.de · edit-2 1 year ago

The PhysX debate was also before my time. But I read into it, and it seems like they solved it partly software based. Please correct me if I’m wrong, I just skimmed over the PPU subject. But with AI we are talking about hardware limitations, especially memory.

Currently, AI operations mean a lot of time-consuming copy tasks between CPU and GPU.

lilduck@lemm.ee · 25 days ago

What is the significance of semiconductors in chatbot technology?How do semiconductors enhance chatbot capabilities?Can chatbots powered by semiconductors understand and respond to human emotions?What role do semiconductors play in voice-based interactions with chatbots?

Tips: the datasheet (https://www.icdrex.com/the-future-of-communication-chatbots-powered-by-semiconductors/) may help you a little.

TheTrueLinuxDev@beehaw.org · 1 year ago

Absolutely, I would suggest looking into two separate devices that focuses solely on AI acceleration:

Analog Neural Accelerator

and

Neural Processing Unit

Port8080@feddit.de · 1 year ago

Two very interesting articles. Thank you for that!

Especially the analog processor is a game changer with having the computation directly in memory. Generally, analog computers are a very interesting subject!

averyminya@beehaw.org · 1 year ago

Look into what Mystic AI was doing. It’s effectively what you were talking about but based in reality :)

maynarkh@feddit.nl · 1 year ago

Good question, but I’d say that the same train of thought went through dedicated physics cards. I’d guess that an AI card should have a great value proposition to be worth buying.

For compatibility, they could e.g. offer a cloud based subscription system.

I’m not sure where you’re going with this, but it feels wrong. I’m not buying a hardware part that cannot function without a constant internet connection or regular payment.

Port8080@feddit.de · 1 year ago

I’d guess that an AI card should have a great value proposition to be worth buying.

Sure the card should have great value or must have an accessible price. It probably also depends on how “heavy” the tasks get. But seeing e.g. OpenAI struggling with requests, it may be useful to decentralize the processing (with running the model locally on the user’s pc).

I’m not sure where you’re going with this, but it feels wrong. I’m not buying a hardware part that cannot function without a constant internet connection or regular payment.

Maybe this statement was a bit confusing. What I meant was, that in a transition phase developers could choose to allow the usage of a dedicated accelerator card to run everything locally and offline. And for people who don’t have or want such a card they could provide a cloud based subscription model, where the processing is done on remote servers.