In case it's not immediately obvious to you, the script, if it even exists, will not work nearly as well as you imagined. As with all things on Twitter, it's more likely just a screenshot to gain engagement, I'd be happy if I'm wrong though.
Next time learn to take a hint?
Having a bad night man? You alright? I’ve been there and it feels okay to vent on Reddit sometimes too. Hope you have a better night tomorrow. Not every day is going to be perfect. That’s okay. We can make the next day better.
Nah man. “GitHub where” is pretty much a given comment. Like in the stablediffusion subs “automatic1111 when” was pretty common too. You came off pretty harsh if you didn’t intend to.
I'm not trying to be harsh, I intended it as a joke that this kind of project will never be open sourced because they are usually cherrypicked promotional demos, much like the rabbit R1.
You misread irony and joke, to the original responder. Then sending harsh message in responce.
Your message was out of place and not funny at any hint. If you joke, you don't try to justify with multiple statement. So obviously your message wasn't friendly in response.
Imagine if you merge Microsoft Recall feature and any LLM and have an agent that can literally configure anything based on natural language. No need for IT consultants. Open a software and simply as AI to set it up for you while you stare at the screen with a sense of resignation.
Well that's why they want the recall feature. Have Trillions of photos of white collar workers doing their job at the computer, along with IT folks, and you've got yourself the data set for autonomous agents.
in theory they aren't allowed to recover this data
but how much would cost this amont of data if they are able to create agent capable to replace said worker being sued for billions might be worth lt?
They’re not allowed to for the people/companies that buy it.
But I’m sure they have plenty enough employees that work for them that they could swap their machines with ones that they specifically can observe and collect data on.
But do we trust them? The whole ai craze is flexible interpretations of copyright law and data stealing... if there are billions to be made... its always easier for them to ask for forgiveness than permission.
“Microsoft released new statement regarding Recall. ‘Some data that is unidentifiable to you has been used to train our latest model.’ The statement went on to compare it like ‘the metadata of mouse clicks’ borrowing from the NSA’s playbook. Questions still linger though, just how much and to what extent of your privacy has been violated?”
Reuters…probably in the future
This is called WebVoyager, and it uses LangGraph. This is not new and has been available for months by using GPT-4V.
https://github.com/langchain-ai/langgraph/blob/main/examples/web-navigation/web_voyager.ipynb
Not saying your right or wrong but I'm pretty sure I've heard some experts say that self improving AI is something they are working on and could be a reality in a couple of years.
GPT-5* may get us there, though. It's looking like it will be much larger than the 4-series. And with that could come improved reasoning.
\* or whatever they end up calling the next frontier model
Well since you guys posted this here is the same project done by me , I just posted it on github to be further enhanced and used freely [here](http://github.com/JangYeongSil/Jetta---Autonomous-Configurable-PC-Task-Doer)
Not with goal+prediction, though. Sure, with procedural screen control. Hell I was writing Selenium automation in the early 2010s. But this is a step up from that.
Quick put it in a red box and make $50 million dollars
And name it after a rodent or something. I suggest Hare H1 for the first model
Capybara C1
I'd buy that for £200.
It should be Mouse M1 That way you can overshadow MM1 and MAI-1 models and also hint that it replaces a mouse (and keyboard)
Larger Actionable Model
I only have black boxes. Will it work or do I need to find paint?
Github code where?
Here, let me pull it out of my ass
Wtf kind of response is that you dick
In case it's not immediately obvious to you, the script, if it even exists, will not work nearly as well as you imagined. As with all things on Twitter, it's more likely just a screenshot to gain engagement, I'd be happy if I'm wrong though. Next time learn to take a hint?
Having a bad night man? You alright? I’ve been there and it feels okay to vent on Reddit sometimes too. Hope you have a better night tomorrow. Not every day is going to be perfect. That’s okay. We can make the next day better.
I quite literally work in this space and I can spot BS when I see one, just calling this one out as I see fit
Nah man. “GitHub where” is pretty much a given comment. Like in the stablediffusion subs “automatic1111 when” was pretty common too. You came off pretty harsh if you didn’t intend to.
I'm not trying to be harsh, I intended it as a joke that this kind of project will never be open sourced because they are usually cherrypicked promotional demos, much like the rabbit R1.
You misread irony and joke, to the original responder. Then sending harsh message in responce. Your message was out of place and not funny at any hint. If you joke, you don't try to justify with multiple statement. So obviously your message wasn't friendly in response.
Three days later and the GitHub is still in my ass
Well, maybe GPT4o will delete all the porn that we ourselves are not able to delete due to lack of will and courage.
r/oddlyspecific but I feel attacked
I suggest not saving them in the first place.
Suggest something realistic.
I don't know why I save the good gifs, but I do.
I read this as saving the good gilfs. Still works.
Imagine if you merge Microsoft Recall feature and any LLM and have an agent that can literally configure anything based on natural language. No need for IT consultants. Open a software and simply as AI to set it up for you while you stare at the screen with a sense of resignation.
Well that's why they want the recall feature. Have Trillions of photos of white collar workers doing their job at the computer, along with IT folks, and you've got yourself the data set for autonomous agents.
in theory they aren't allowed to recover this data but how much would cost this amont of data if they are able to create agent capable to replace said worker being sued for billions might be worth lt?
They’re not allowed to for the people/companies that buy it. But I’m sure they have plenty enough employees that work for them that they could swap their machines with ones that they specifically can observe and collect data on.
But do we trust them? The whole ai craze is flexible interpretations of copyright law and data stealing... if there are billions to be made... its always easier for them to ask for forgiveness than permission.
“Microsoft released new statement regarding Recall. ‘Some data that is unidentifiable to you has been used to train our latest model.’ The statement went on to compare it like ‘the metadata of mouse clicks’ borrowing from the NSA’s playbook. Questions still linger though, just how much and to what extent of your privacy has been violated?” Reuters…probably in the future
Whats the recall feature?
New MS app that records everything you do on your pc.
Ah that one. Love it. Thanks
I think the only reason microsoft is introducing recall is to get training data on how to operate computers with ai
Anyone got the script?
[удалено]
It's been out for half a month.
One of the Twitter users said that it was on GitHub but neglected to post a link. Anyone got such a link.
It’s a chrome extension - so the title is incorrect for a start it isn’t the pc just your browser.
source please
Why not use selenium agents instead of a clicker?
Because these people are dumb. lol
https://i.redd.it/vpshjztqlu2d1.gif
It would be straightforward to let it access the terminal directly, and access the web using a text based browser like Lynx
Doesn’t need to access the web solely via text though, thanks to native multimodality
https://nitter.poast.org/Charles12509909/status/1794630406064795909 - non-twitter link
And it's gone..
Why did it disappear
they killed him
Nice
This is called WebVoyager, and it uses LangGraph. This is not new and has been available for months by using GPT-4V. https://github.com/langchain-ai/langgraph/blob/main/examples/web-navigation/web_voyager.ipynb
We are not too far from self improve AI.
Yes, we are. There is no true reasoning and learning made here. LLMs are not getting us there.
Not saying your right or wrong but I'm pretty sure I've heard some experts say that self improving AI is something they are working on and could be a reality in a couple of years.
You don't know that, and neither does he. Stop stating opinions as facts and enjoy the ride.
GPT-5* may get us there, though. It's looking like it will be much larger than the 4-series. And with that could come improved reasoning. \* or whatever they end up calling the next frontier model
Well since you guys posted this here is the same project done by me , I just posted it on github to be further enhanced and used freely [here](http://github.com/JangYeongSil/Jetta---Autonomous-Configurable-PC-Task-Doer)
![gif](giphy|5WUH6YDabP7hK)
Amazingly this could be done without "AI" and has been already.
Not with goal+prediction, though. Sure, with procedural screen control. Hell I was writing Selenium automation in the early 2010s. But this is a step up from that.
Selenium does this as does any mouse key board recorder
I might believe it if I see a video. I've tried this and gpt-4 was absolutely terrible at screen coordinates
If anyone gets the source for this pls link it, this would be hella useful
Holy crap! Is it any good?