π—šπ—£π—§-πŸ°π—© based agent // can work on your computer instead of you

sbagency
Nov 8, 2023

--

https://cdn.openai.com/papers/GPTV_System_Card.pdf
https://twitter.com/josh_bickett/status/1721975391047589934

We used GPT-4Vision to create a self-operating computer.

It can look at any UI and determine what clicks and keystrokes are needed to accomplish a task.

In this demo, we ask it to write a poem in Apple Notes.

GPT-4 viewed what was on the screen, found the open note, and then started clicking and typing the poem inside it.

We’re just beginning to scratch the surface of what’s possible by combining language, vision, and automation in a single AI system.

The future is here and we’re building it.

Shoutout to our team’s new dad Josh Bickett, pushing boundaries with this demo β€” all while balancing family time with his newborn at home! [link]

--

--

sbagency
sbagency

Written by sbagency

Tech/biz consulting, analytics, research for founders, startups, corps and govs.

No responses yet