Memory Models - Search News

The latest Gemma 4 models use a training trick to slash their on-device memory footprint

You can now download Gemma 4 models with quantization-aware training to reduce the amount of mobile memory required to 1GB.

Quantum circuits help AI overcome memory limitations with minimal new parameters

For millions of people, chatbots powered by large language models (LLMs) are now a key feature of everyday life. These AI ...

MeMo's memory model lets teams upgrade their LLM without retraining it — and performance jumps 26%

Researchers' MeMo keeps AI memory separate from reasoning, so teams can upgrade their LLM without retraining it and see a 26% ...

Google’s new Gemma 4 12B model is designed to run on any laptop with 16GB of RAM

The company has announced the release of a new Gemma 4 model that fills a gap in the lineup that launched earlier this year.

Geeky Gadgets

LangChain Memory Models : The Future of Conversational AI?

What if your AI could remember every meaningful detail of a conversation—just like a trusted friend or a skilled professional? In 2025, this isn’t a futuristic dream; it’s the reality of ...

Tether Brings AI Memory Compression To Consumer Devices

Tether’s TurboQuant enables useful and powerful local AI applications on consumer devices at much lower costs and without ...

VentureBeat

New LLM optimization technique slashes memory costs up to 75%

Researchers at the Tokyo-based startup Sakana AI have developed a new technique that enables language models to use memory more efficiently, helping enterprises cut the costs of building applications ...

The Manila Times

ZetaChain: The Private Memory Layer for AI

ZetaChain launched Anuma, its first consumer AI product: the private AI that remembers, with one encrypted memory across ...

Geeky Gadgets

AI Memory Hacks: Boosting AI Model Performance with Context

In the fast-paced world of artificial intelligence, memory is crucial to how AI models interact with users. Imagine talking to a friend who forgets the middle of your conversation—it would be ...

Interesting Engineering on MSN

Intel-backed memory tech powers 26-billion-parameter models on PCs with just 16 GB RAM

Phison says its new memory extension technology can run a 26-billion-parameter language model on ...

XDA Developers on MSN

High-VRAM GPUs aren't the future of local AI — unified memory and mixture of experts models are

GPUs are fast, but they have limited RAM. Unified memory machines are big, but they have less bandwidth.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results