- cross-posted to:
- technology@lemmy.ml
- technology@lemmy.world
- cross-posted to:
- technology@lemmy.ml
- technology@lemmy.world
cross-posted from: https://lemmy.world/post/25011462
SECTION 1. SHORT TITLE
This Act may be cited as the ‘‘Decoupling America’s Artificial Intelligence Capabilities from China Act of 2025’’.
SEC. 3. PROHIBITIONS ON IMPORT AND EXPORT OF ARTIFICIAL INTELLIGENCE OR GENERATIVE ARTIFICIAL INTELLIGENCE TECHNOLOGY OR INTELLECTUAL PROPERTY
(a) PROHIBITION ON IMPORTATION.—On and after the date that is 180 days after the date of the enactment of this Act, the importation into the United States of artificial intelligence or generative artificial intelligence technology or intellectual property developed or produced in the People’s Republic of China is prohibited.
Currently, China has the best open source models in text, video and music generation.
I could understand banning closed source models but open sourced models that work better than anything propriety isn’t that just the free market that corporations like to pretend to be part of?
Define “open sourced model”.
The neural network is still a black box, with no source (training data) available to build it, not to mention few people have the alleged $5M needed to run the training even if the data was available.
Define “open sourced model”.
The term itself is actually shockingly simple. Source is the original material that was used to build this model, training data and all files that are needed to compile and create the model. It’s Open Source, if these files are available (preferably with an Open Source compatible license). It’s not. We only get binary data, the end result and some intermediate files to fine tune it.
Well its still not Open Source.
Is part of the code not available?
None of the code and training data is available. Its just the usual Huggingface thing, where some weights and parameters are available, nothing else. People repeat DeepSeek (and many other) Ai LLM models being open source, but they aren’t.
They even have a Github source code repository at https://github.com/deepseek-ai/DeepSeek-R1 , but its only an image and PDF file and links to download the model on Huggingface (plus optional weights and parameter files, to fine tune it). There is no source code, and no training data available. Also here is an interesting article talking about this issue: Liesenfeld, Andreas, and Mark Dingemanse. “Rethinking open source generative AI: open washing and the EU AI Act.” The 2024 ACM Conference on Fairness, Accountability, and Transparency. 2024
This literally took one click: https://github.com/deepseek-ai
Stop spreading FUD.
Where’s the training data?
Does open sourcing require you to give out the training data? I thought it only means allowing access to the source code so that you could build it yourself and feed it your own training data.
Open source requires giving whatever digital information is necessary to build a binary.
In this case, the “binary” are the network weights, and “whatever is necessary” includes both training data, and training code.
DeepSeek is sharing:
- NO training data
- NO training code
- instead, PDFs with a description of the process
- binary weights (a few snapshots)
- fine-tune code
- inference code
- evaluation code
- integration code
In other words: a good amount of open source… with a huge binary blob in the middle.
I fear I’ve become something of an accelerationist in the past few days…
yeah, go ahead and pass this, you tech-illiterate xenophobic fucks.
we need to divide and conquer the fascist coalition. make them hate each other. make them consumed by infighting. give them more “oh I didn’t realize there would be negative consequences that affected me personally” moments.
there’s a whole lot of Silicon Valley techbro types who are on board with Musk and Trump because they think it’s all lower taxes, less regulations for their startups, and less “wokeness”. go ahead, pass a law that makes it a federal crime for them to click a GitHub download link. make it so that every Hacker News thread about AI is filled with American engineers bemoaning that they’re legally prohibited from keeping up with the state-of-the-art. make their startups uncompetitive because they’re required by law to pay inflated prices to subsidize OpenAI and other “American-made” plagiarism machines.
deleted by creator