Final update: The developer is now on Mastodon via @andrew_guide.
Update: The developer has removed the ability to download Guide until the security issues mentioned in the linked thread are fixed.
Update: this product contains some code flaws that are concerning from a security perspective, beyond just giving control of your computer to an LLM. You might want to read this thread before installing the product: toot.cafe/@matt/114258349401221651
Update: I've exchanged some long emails with Andrew, the lead developer. He's open to dialogue, and moving the project in the right direction: well-scoped single tasks, more granular controls and permissions, etc. He doesn't strike me as an #AI maximalist can and should do everything all the time kind of guy. He's also investigating deeper screen reader interaction, to let AI just do the things we can't do that it's best at. I stand by my thoughts that the project isn't yet ready for prime time. But as someone else in the thread said, I don't think it should be written off entirely as yet another "AI will save us from inaccessibility" hype train. There is, in fact, something here if it gets polished and scoped a bit more.
Just tried guide for fun. It's supposed to be an app to use #AI to help #blind folks get things done. I asked "Where are the best liver and onions in Ottawa?" It:
1. Decided it needed to search the web.
2. Thought that the "stardew access" icon on my desktop was a kind of web browser, so clicked it.
3. Imagined an "accept cookies" dialogue it needed to accept.
4. Decided that didn't work, so looked for Google Chrome (I don't have chrome installed on that machine)
5. Finally opened edge from the start menu. By the way, it just...left Stardew open and running. Because apparently having Stardew Valley running in the background is a vital part of finding liver and onions in Ottawa.
6. Opened a random extension from my edge toolbar (goodlinks).
7. Clicked the address bar and loaded google.com, instead of just doing the search right from the address bar.
8. Got blocked because it couldn't sign into my Google account, even though it could have also searched from the Google homepage.
To be fair to AI, that was the kind of open-ended task AI is terrible at. If I had asked it to check an inaccessible checkbox, or read a screenshot, or something, I'm sure it would have been fine.
Anyway, I'm still better at using a computer than an AI. So is my 87 year old grandfather, for that matter. www.guideinteraction.com
@fastfinge this kind of thing, where you just give an LLM full access to fuck around on your computer, terrifies me. Even more so that it's designed to be used by people who literally can't see what it's doing. How can anyone think this is safe?
@aburka We can hear exactly what it’s doing with our screen readers. And there is a stop hot key. I’m not concerned about safety. I’m more concerned about privacy, and correctly setting expectations. I work in tech, and I spend at least a couple of hours a day struggling with inaccessible apps and websites, and have to have a co-worker do certain tasks for me. Anything that can make this better is of interest to me. Sure, everyone should build accessible apps. But for the most part, nobody bothers. But I still have to work. However, solutions like this need to be strictly sandboxed to the app I actually need help with, not given access to the entire computer. Not only is this more secure, private, and safer, it also works better. The AI doesn’t need to get distracted by the fact that I have BBalatro installed, or that a windows notification popped up. Just give it the current window.Also, the examples for use need to be more like “Ask me to check a keyboard inaccessible checkbox for you” or “I can help you read an image without alt text”. The current examples are more like “Ask me to book a flight for you”. In its current form, AI can’t do that. The developer has strongly committed to fixing both of these things.
@fastfinge @aburka I agree with this, if it enables me to do tasks I previously couldn’t do before, I’m all for it.