AI Week in Review 25.08.16

Aug 16

Gemma 3 270M, SuperFly & Chickbrain / Llama 3.1 8B Slim, LFM2-VL, Matrix-Game 2.0, Matrix-3D, Nvidia Cosmos & Omniverse, Sonnet 4 gets 1 million token context, Jan-v1, gpt-oss-20b-base, CoAct-1.

Read →

4 Comments

Arthur Li

I just don’t get the point of 270M models because most phones are capable of running at least 1b-sized models. And most phones manufactured after 2024 can run qwen3-4b-2507 which is 100 times more powerful than that 270m model. For example, my iPhone 15 Pro can run it at a speed of 19 tps.

Expand full comment

Reply (2)

Patrick McGuinness

I kind of agree. 270M is an interesting engineering feat, but many applications don't need that extreme efficiency; a smartphone can run 1b-4b AI models. That small could be used for IoT applications tho.

Expand full comment

Reply (1)

Arthur Li

18h

But I guess this small model will hallucinate a lot, which is somehow unacceptable for loT devices

Expand full comment

Arthur Li

Oh, but it will be very great for fine-tuning

Expand full comment

AI Changes Everything

AI Week in Review 25.08.16