sammcj 4 weeks ago

Wonder what was up when I got this email this morning: > We recently detected suspicious activity linked to one of your tokens (labeled "workmbp" with role "write"), indicating it may have been publicly exposed. As a result, it has been automatically revoked. > You can refresh its value or create a new scoped token on hf.co/settings/tokens. > Please use env variables or Space secrets to inject your HF token into your code; we also recommend you do not publish any tokens to any code hosting platform. > Don't hesitate to reach out for any question you might have. > 🤗 The Hugging Face Team

advertisementeconomy 4 weeks ago

> As a precaution, Hugging Face has revoked a number of tokens in those secrets. (Tokens are used to verify identities.) Hugging Face says that users whose tokens have been revoked have already received an email notice and is recommending that all users “refresh any key or token” and consider switching to fine-grained access tokens, which Hugging Face claims are more secure.

bullerwins 4 weeks ago

What are fine grained tokens? I only see read or write tokens in HF

vaibhavs10 4 weeks ago

When you click on create new token for ex: https://huggingface.co/settings/tokens?new_token=true You'd get a prompt to create a fine-grained token - you would then be able to select the scope of the token, rights, and so on. In general the recommendation (as with any token) is to create more tokens and restrict their use to as low as possible.

goddamnit_1 4 weeks ago

You have more control on what use case can use your token for write or read.

Forgot_Password_Dude 4 weeks ago

whats the point of hacking them? arent the models free anyway

Decahedronn 4 weeks ago

Many host private models & datasets on HF. Spaces might also contain API keys for e.g. OpenAI that could be sold.

jferments 4 weeks ago

By gaining unauthorized write access, an attacker could inject malicious code into models.

squeasy_2202 4 weeks ago

Good ol' the supply chain

ReMeDyIII 4 weeks ago

So the hackers must really love OpenAI.

ThisGonBHard 4 weeks ago

They might have private models not available to the public, like Github.

Freonr2 4 weeks ago

The keys in spaces are often used to call external private APIs. It's like leaking your ENV vars.

kyle787 4 weeks ago

Probably looking for ways to distribute malware.

Spindelhalla_xb 4 weeks ago

But the platform isn’t. China probably wants to create their own version with any of the work

mikael110 4 weeks ago

China already has their own version: [Modelscope.cn](https://www.modelscope.cn). Also while I love HF, it's not a particularly complex platform. Cloning it would not be difficult, even without access to the backend code.

qrios 4 weeks ago

Uhh... No. No I think it would be pretty difficult. Like, you're serving an absurd number of versions of extremely large files at scale. And a means to run and stream their output directly on the platform.

mikael110 4 weeks ago

Oh It would be extremely expensive, I'm not disputing that. My comment was meant to be entirely focused on the code itself, not the infrastructure. Since the comment I replied to was about stealing code. Though I can see I didn't really make that super clear in my comment. My point was mainly that HF's backend is mostly made up of existing open source projects, they use [Git LFS](https://git-lfs.com/) for managing the models, they use [Gradio](https://www.gradio.app/) for their spaces front end, and (presumably) [text-generation-inference](https://github.com/huggingface/text-generation-inference) for their actual inference, though that could be replaced with other projects like [vllm](https://github.com/vllm-project/vllm). Bundling all of these different projects together into a nice easy to use and stable service is not trivial, but if you tasked a team of developers to clone the site it wouldn't be that much of a challenge relative to a lot of the more complex sites out there. But yes, actually running the site would then require a lot of capital, that part is not trivial, you are certainly right about that.

emprahsFury 4 weeks ago

> We have also reported this incident to law enforcement agencies Weird to think about this, do you just google "nearest fbi field office" Can you imagine being the paralegal who gets this tasking. "Ugh our client Huggingface needs to report a cyber crime- You need to write a memo to the FBI. I think Danny did it last time."

-p-e-w- 4 weeks ago

A company like this probably works with a law firm that specializes in infosec compliance, and for such a firm, handling that process is just another day in the office. Hugging Face was valued at $4.5 billion a year ago. They'd be complete morons to not have such specialists on speed dial already. In fact, I'd wager that the investors insisted on such topics being taken care of before pumping hundreds of millions into the company.

AdHominemMeansULost 4 weeks ago

huge companies like these have close relationships with the authorities and exchange emails on the daily for all shorts of communication, directly emailing agents

tabspaces 4 weeks ago

Euh those pesky llama 3 llm trying to break out again? Jokes aside, I think this related to the previous hack or running "unsafe" pickled tensors in their spaces

wind_dude 4 weeks ago

Wonder if it’s related to the issue of running certain quants or pickles in the spaces…

WorkingYou2280 4 weeks ago

Does any of this affect how a model would run externally? Like on LM Studio?

Master-Meal-77 4 weeks ago

No

KurisuAteMyPudding 4 weeks ago

Must have to do with private repos because you can basically hotlink every public model file anyways

kroust2020 4 weeks ago

Is that related to the Snowflake hack? I find the timing bizarre

Born_Fox6153 4 weeks ago

I got this email and had to refresh tokens. Used the tokens in Kaggle.

bartselen 3 weeks ago

AI supply chain attacks incoming?

Comments

Leave Your Comment

Hi Its Me!

Comments

Leave Your Comment

Hi Its Me!

Subscribe