Large Language Models
It is no secret that I've been very enthusiastic about The Commercial NLP Renaissance a.k.a. Large Language Models (yet a bit skeptical about the type of people its attracting).
I've been building a lot of stuff with them, mainly Printloop, but also BORI.
This week is expected to be big for language models as GPT-4 is rumored to be announced, but also high-performance language models are very close to land in consumer hardware!
Language Computer
I've been looking for ways to build a "Language Computer" for my home; I want to run a service on my own hardware that I can use to build personal language tools, services, and home automation workflows.
I also want to host other AI services at home (like speech-recognition), but my focus is on language models first because I believe it's the most useful.
Turns out, I am very close to making this a reality! We'll soon be able to run a language model with performance comparable to OpenAI's Davinci GPT models on our own hardware.
I highly recommend reading Simon Wilson's "Stanford Alpaca" post discussed the acceleration of large language model development on-device, particularly talking about Stanford Alpaca and llama.cpp.
I already so many ideas for things I will be able to do with my own language service at home:
- Digitize, summarize, categorize, and sort my snail mail
- Digest, summarize, and present the news to me
- An army of mechanical historians that indefinitely browse the web, recording findings about topics that I am interested in, keeps track of trends, and writes me a report every day