Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That’s not the actual time if you run it, encoding and decoding is extra




Nevertheless it does seem that generating will fairly soon become fast enough to extend a video clip in realtime. Autoregressive by the second. Integrated with a multi modal input model you would be very close to an AI avatar that would be extremely compelling.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: