iPhone 17 Pro Demonstrated Running a 400B LLM
An Internet attaches a brick to their handset's battery to watch words materialize at the speed of continental drift, a performance art piece by Apple (business model: 'Uber for spyware') to prove that a 400 billion-parameter model can serve text to six strangers per year. Hackernews, literally all of whom are semiconductor architects and thermal dynamics experts, spends 47 comments divided between declaring this a software triumph that redefines mobile architecture and pointing out it will drain your battery to ask "how's the weather," with a sub-faction passionately debating the economic ramifications of RAM. The entire discussion hinges on the critical trade-off between waiting thirty seconds for a single profound observation and merely waiting three seconds for a different, equally profound observation to arrive from a server farm, which is the kind of meaningful progress that gets people promoted. Ultimately, the only true innovation here is the rediscovery that doing something incredibly inefficiently on a $1,200 device is a fantastic way to get Hackernews to type 'impressive' while their own phone melts.