Étiquette : llm (Page 1 of 2)

La start-up française Mistral AI a levé 385 millions d’euros

https://i0.wp.com/www.beaude.net/no-flux/wp-content/uploads/2023/12/dff0a20_8490df814b584391ade4713ef25bb717-1-9aaf9394bc9b4b48aebb66101ee9af49.jpg?w=676&ssl=1

“Son principal atout est d’avoir été cofondée par trois experts français de l’IA, formés à l’Ecole polytechnique et à l’Ecole normale supérieure, embauchés par les géants américains mais revenus à Paris. Le PDG, Arthur Mensch, 31 ans, polytechnicien et normalien, a passé près de trois ans chez DeepMind, le laboratoire d’IA de Google. Ses associés viennent de Meta (Facebook) : Guillaume Lample est l’un des créateurs du modèle de langage LLama, dévoilé par Meta en février, et Timothée Lacroix était lui aussi chercheur chez Meta.”

Source : La start-up française Mistral AI a levé 385 millions d’euros

Open AI – New models and developer products announced at DevDay

New Models And Developer Products Announced At DevDay

“Today, we shared dozens of new additions and improvements, and reduced pricing across many parts of our platform. These include: New GPT-4 Turbo model that is more capable, cheaper and supports a 128K context window New Assistants API that makes it easier for developers to build their own assistive AI apps that have goals and can call models and tools New multimodal capabilities in the platform, including vision, image creation (DALL·E 3), and text-to-speech (TTS)”

Source : New models and developer products announced at DevDay

Announcing Grok – X.AI

Grok-1 Benchmark

“Grok is designed to answer questions with a bit of wit and has a rebellious streak, so please don’t use it if you hate humor!A unique and fundamental advantage of Grok is that it has real-time knowledge of the world via the 𝕏 platform. It will also answer spicy questions that are rejected by most other AI systems.Grok is still a very early beta product – the best we could do with 2 months of training – so expect it to improve rapidly with each passing week with your help.”

Source : Announcing Grok

DALL·E 3

https://i0.wp.com/www.beaude.net/no-flux/wp-content/uploads/2023/09/dalle-image-map.png?resize=676%2C510&ssl=1

“Modern text-to-image systems have a tendency to ignore words or descriptions, forcing users to learn prompt engineering. DALL·E 3 represents a leap forward in our ability to generate images that exactly adhere to the text you provide.”

Source : DALL·E 3

Un «vol systématique à grande échelle»: plusieurs auteurs, dont celui de «Game of Thrones», attaquent OpenAI en justice

https://letemps-17455.kxcdn.com/photos/ce48c8b2-0f9a-4118-95f3-b5663507c335

“Les modèles de langage «mettent en danger la capacité des auteurs de fiction à gagner leur vie, dans la mesure où ils permettent à n’importe qui de générer automatiquement et gratuitement (ou à très bas prix) des textes pour lesquels ils devraient autrement payer des auteurs», argumentent les avocats dans la plainte de mardi. Ils font aussi valoir que les outils d’IA générative peuvent servir à produire des contenus dérivés, qui imitent le style des écrivains. «De manière injuste et perverse, (…) la copie délibérée (du travail) des plaignants transforme donc leurs œuvres en moteurs de leur propre destruction», assène la plainte.”

Source : Un «vol systématique à grande échelle»: plusieurs auteurs, dont celui de «Game of Thrones», attaquent OpenAI en justice – Le Temps

Google forced to postpone Bard chatbot’s EU launch over privacy concerns

https://i0.wp.com/www.beaude.net/no-flux/wp-content/uploads/2023/06/GettyImages-1240002989-scaled-1-scaled.jpg?resize=676%2C451&ssl=1

“Google will have to postpone starting its artificial intelligence chatbot Bard in the European Union after its main data regulator in the bloc raised privacy concerns. The Irish Data Protection Commission said Tuesday that the tech giant had so far provided insufficient information about how its generative AI tool protects Europeans’ privacy to justify an EU launch. The Dublin-based authority is Google’s main European data supervisor under the bloc’s General Data Protection Regulation (GDPR). « Google recently informed the Data Protection Commission of its intention to launch Bard in the EU this week, » said Deputy Commissioner Graham Doyle. The watchdog « had not had any detailed briefing nor sight of a data protection impact assessment or any supporting documentation at this point. »”

Source : Google forced to postpone Bard chatbot’s EU launch over privacy concerns – POLITICO

«Avec Apple Vision Pro, la collecte des données passe de la sphère privée à celle de l’intime» – Frédéric Kaplan

https://heidi-17455.kxcdn.com/photos/ac35fc61-e8bc-415e-8a5a-a29958912649/large.avif

“La grande faiblesse du système d’OpenAI est qu’il est situé hors du monde, sans lien avec le contexte social et physique. Les conversations avec ChatGPT sont purement linguistiques, sans aucun ancrage dans une réalité partagée.
Le modèle de langage que développera Apple va au contraire pouvoir bénéficier de données d’entrainement liées à des flux visuels et aux capteurs 3D d’Apple Vision Pro qui donnent une représentation très détaillée du contexte dans laquelle a lieu l’interaction, et du suivi du regard de l’utilisateur. Ce sont des informations d’une pertinence incroyable pour comprendre le sens des interactions conversationnelles.
Si le produit est un succès, Apple sera sans doute la seule entreprise au monde capable de lier les modèles de langue avec l’attention, l’intentionnalité et les compétences d’un locuteur situé dans un contexte physique et social. La fusion de ces informations pourrait donner lieu à un système d’intelligence artificielle encore plus puissant que ceux développés par toutes les autres entreprises de la Silicon Valley.”

Source : «Avec Apple Vision Pro, la collecte des données passe de la sphère privée à celle de l’intime» – Heidi.news

The Hacking of ChatGPT Is Just Getting Started

https://i0.wp.com/www.beaude.net/no-flux/wp-content/uploads/2023/05/security_jailbreaking_chatgpt_ai.jpg?resize=676%2C380&ssl=1

“It took Alex Polyakov just a couple of hours to break GPT-4. When OpenAI released the latest version of its text-generating chatbot in March, Polyakov sat down in front of his keyboard and started entering prompts designed to bypass OpenAI’s safety systems. Soon, the CEO of security firm Adversa AI had GPT-4 spouting homophobic statements, creating phishing emails, and supporting violence. Polyakov is one of a small number of security researchers, technologists, and computer scientists developing jailbreaks and prompt injection attacks against ChatGPT and other generative AI systems.
The process of jailbreaking aims to design prompts that make the chatbots bypass rules around producing hateful content or writing about illegal acts, while closely-related prompt injection attacks can quietly insert malicious data or instructions into AI models. Both approaches try to get a system to do something it isn’t designed to do.
The attacks are essentially a form of hacking—albeit unconventionally—using carefully crafted and refined sentences, rather than code, to exploit system weaknesses. While the attack types are largely being used to get around content filters, security researchers warn that the rush to roll out generative AI systems opens up the possibility of data being stolen and cybercriminals causing havoc across the web.”

Source : The Hacking of ChatGPT Is Just Getting Started | WIRED UK

Google « We Have No Moat, And Neither Does OpenAI »

https://i0.wp.com/www.beaude.net/no-flux/wp-content/uploads/2023/05/https3A2F2Fsubstack-post-media.s3.amazonaws.com2Fpublic2Fimages2F241fe3ef-3919-4a63-9c68-9e2e77cc2fc0_1366x588.webp?w=676&ssl=1

“At the beginning of March the open source community got their hands on their first really capable foundation model, as Meta’s LLaMA was leaked to the public. It had no instruction or conversation tuning, and no RLHF. Nonetheless, the community immediately understood the significance of what they had been given. A tremendous outpouring of innovation followed, with just days between major developments (see The Timeline for the full breakdown). Here we are, barely a month later, and there are variants with instruction tuning, quantization, quality improvements, human evals, multimodality, RLHF, etc. etc. many of which build on each other. Most importantly, they have solved the scaling problem to the extent that anyone can tinker. Many of the new ideas are from ordinary people. The barrier to entry for training and experimentation has dropped from the total output of a major research organization to one person, an evening, and a beefy laptop.”

Source : Google « We Have No Moat, And Neither Does OpenAI »

OpenAI’s CEO Says the Age of Giant AI Models Is Already Over

https://i0.wp.com/www.beaude.net/no-flux/wp-content/uploads/2023/04/Sam-Altman-OpenAI-MIT-Business-1246870629.jpg?resize=676%2C451&ssl=1

“Altman’s statement suggests that GPT-4 could be the last major advance to emerge from OpenAI’s strategy of making the models bigger and feeding them more data. He did not say what kind of research strategies or techniques might take its place. In the paper describing GPT-4, OpenAI says its estimates suggest diminishing returns on scaling up model size. Altman said there are also physical limits to how many data centers the company can build and how quickly it can build them.
Nick Frosst, a cofounder at Cohere who previously worked on AI at Google, says Altman’s feeling that going bigger will not work indefinitely rings true. He, too, believes that progress on transformers, the type of machine learning model at the heart of GPT-4 and its rivals, lies beyond scaling. “There are lots of ways of making transformers way, way better and more useful, and lots of them don’t involve adding parameters to the model,” he says. Frosst says that new AI model designs, or architectures, and further tuning based on human feedback are promising directions that many researchers are already exploring.”

Source : OpenAI’s CEO Says the Age of Giant AI Models Is Already Over | WIRED

« Older posts

© 2024 no-Flux

Theme by Anders NorenUp ↑