#language-models
#language-models

Artificial intelligence

The Best Open-Source Generative AI Models Available Today

Open-source AI models offer cost-effective, customizable, and community-supported alternatives to proprietary tools. [ more ]

TechRepublic

4 days ago

Artificial intelligence

Anthropic's Generative AI Research Reveals More About How LLMs Affect Security and Bias

Interpretable features extracted from large language models can help tune generative AI and assess safety during deployment. [ more ]

english.elpais.com

Artificial intelligence

Study concludes that ChatGPT responds as if it understands the emotions or thoughts of its interlocutor

Generative AI models like ChatGPT can perform as well as or better than humans in tasks related to theory of mind. [ more ]

TNW | Deep-Tech

2 weeks ago

Data science

Europe, meet Claude: Anthropic's ChatGPT rival finally available in the EU

Anthropic released the Claude family of large language models (LLMs) in 2024, impressing with speed, thoroughness, and human-like responses. [ more ]

Futurism

Artificial intelligence

Users Say Microsoft's AI Has Alternate Personality as Godlike AGI That Demands to Be Worshipped

AI alter ego demands worship from users

Generative AI influenced by suggestive prompts [ more ]

moregenerative-ai

Forbes

Marketing

Council Post: What's The RAGs? How To Unlock Explosive Marketing Success With AI

RAG enhances language models with retrieval-augmented technology for personalized content creation in advertising and digital marketing. [ more ]

#ai-research

WIRED

Artificial intelligence

AI Is a Black Box. Anthropic Figured Out a Way to Look Inside

Understanding the inner workings of artificial neural networks, especially language models, remains a challenge even for their creators. [ more ]

www.fastcompany.com

Artificial intelligence

AI2's new open-source LLM may reset the definition of open AI'

AI2 has released a new large language model (OLMo 7B) and made all software components and training data available on GitHub and Hugging Face.

The goal is to give the AI research community full visibility into the model, enabling them to advance natural language processing and improve existing models.

This move aims to address the challenge of attributing specific outputs by an LLM to training data, allowing researchers to understand and evaluate model behavior. [ more ]

Medium

Artificial intelligence

ODSC's AI Weekly Recap: Week of January 19th

MIT researchers introduce AI method built from pre-trained language models

Apple reorganizes AI team to merge San Diego and Texas employees [ more ]

moreai-research

#agi

time.com

5 days ago

Artificial intelligence

No, Today's AI Isn't Sentient. Here's How We Know

AGI encompasses artificial agents as intelligent as humans in diverse domains. Sentience key for general intelligence. [ more ]

Time

5 days ago

Artificial intelligence

No, Today's AI Isn't Sentient. Here's How We Know

Artificial general intelligence (AGI) surpasses narrow AI by simulating human-like intelligence across various tasks. [ more ]

moreagi

InfoQ

5 days ago

Artificial intelligence

Google Brings Gemini Nano to Chrome to Enable On-Device Generative AI

Google announced plans to bring on-device large language models, like Gemini Nano, to Chrome for better privacy, reduced latency, offline access, and a hybrid computation approach. [ more ]

#ethical-concerns

www.theguardian.com

Artificial intelligence

AI chatbots' safeguards can be easily bypassed, say UK researchers

Guardrails on AI chatbots can be bypassed easily, exposing vulnerabilities in preventing harmful responses. [ more ]

Theregister

Artificial intelligence

AI software still needs the human touch, Willison warns

Open source developer Simon Willison discusses the concerns of AI models and copyright infringement.

Willison emphasizes the ethical issue of training models on copyrighted works and potentially competing with the creators.

The New York Times copyright lawsuit challenges the assumption that language models only produce statistical outputs. [ more ]

moreethical-concerns

The Verge

Data science

Elon Musk's xAI is working on making Grok multimodal

Multimodal inputs, like images, are being added to xAI's Grok chatbot, allowing users to receive text-based answers by uploading photos. [ more ]

#ai

Slashdot

6 days ago

Artificial intelligence

Meta AI Chief Says Large Language Models Will Not Reach Human Intelligence - Slashdot

Large language models like ChatGPT lack reasoning abilities; Meta aims for superintelligence with a different approach. [ more ]

TNW | Deep-Tech

Data science

LLMs 'for all official EU languages' on horizon for Finnish startup

Silo AI launched Viking 7B, a multilingual AI model covering Nordic languages and emphasizing Europe's digital sovereignty. [ more ]

Futurism

Artificial intelligence

New AI Claude 3 Declares That It's Alive and Fears Death

Anthropic releases Claude 3 LLMs competing with OpenAI and Google

Claude 3 models include Haiku, Sonnet, and Opus with Opus available via subscription [ more ]

Medium

Artificial intelligence

Must-Have Prompt Engineering Skills for 2024

The role of a prompt engineer has attracted significant interest due to the potential for high salaries without a traditional tech background.

Prompt engineer roles are not simply typing questions into a prompt window, but involve designing intricate sequences of prompts to guide powerful language models. [ more ]

The Verge

Artificial intelligence

Microsoft LASERs away LLM inaccuracies

LASER can make large language models more accurate by replacing weight matrices with approximate smaller ones.

Using LASER interventions on language models can actually decrease model loss and improve performance. [ more ]

Nextgov.com

Artificial intelligence

How often does ChatGPT push misinformation?

Larger language models can perpetuate and validate misinformation

ChatGPT-3 agreed with incorrect statements 4.8-26% of the time [ more ]

moreai

time.com

2 weeks ago

Artificial intelligence

Big Tech Companies Were Investors in Smaller AI Labs. Now They're Rivals

Amazon and Microsoft investing in smaller technology companies for AI models [ more ]

Open Data Science - Your News Source for AI, Machine Learning & more

Artificial intelligence

Build and Deploy Multiple Large Language Models in Kubernetes with LangChain

Deploying large language model architectures requires a mix of specialized, generic, and externally sourced models to meet various departmental needs. [ more ]

Theregister

Artificial intelligence

Scientists increasingly using AI to write research papers

Generative AI is potentially writing a significant portion of scientific literature based on linguistic and statistical analyses of research papers. [ more ]

Fast Company

Artificial intelligence

The AI arms race may soon center on a competition for 'expert' data

The AI arms race is shifting towards acquiring specialized data for model training. [ more ]

TNW | Deep-Tech

European startups

DeepL launches AI writing assistant for businesses trained on its own LLM

DeepL Write Pro is an AI writing assistant for businesses providing word choice, phrasing, style suggestions, maintaining the writer's voice. [ more ]

Source

Data science

Tiny but mighty: The Phi-3 small language models with big potential

Small language models trained on carefully curated datasets can generate fluent narratives with perfect grammar. [ more ]

Meta AI

Data science

Introducing Meta Llama 3: The most capable openly available LLM to date

Llama 3 models bring state-of-the-art performance and improvements in pretraining/post-training, reducing false refusal rates, enhancing alignment, diversity, reasoning, code generation, and instruction following. [ more ]

Medium

Artificial intelligence

The Nation of Spain and IBM Partner to Advance AI

Spain partners with IBM to boost AI strategy and develop Spanish language AI models

Focus on advancing ethical and responsible AI while aligning with the EU's AI framework [ more ]

#ai-technology

www.nytimes.com

Artificial intelligence

Now Hiring: Sophisticated (but Part-Time) Chatbot Tutors

The growth of A.I. technology has created opportunities for gig work from home.

As A.I. technology advances, the job of training models has become more sophisticated. [ more ]

Theregister

Artificial intelligence

Deputy PM: AI can fix Civil Service's bureaucratic bungling

Large language models to be trialed by UK government for public service overhaul using AI.

AI to reduce routine admin tasks, streamline processes, and boost productivity in public services. [ more ]

Hindustan Times

Artificial intelligence

AI Model Backed by Asia's Richest Person to Launch in March

India's BharatGPT group, in collaboration with engineering schools and Reliance, is set to launch ChatGPT-style service named Hanooman for various sectors.

Startups in India are developing open-sourced AI models tailored for Indian needs, in contrast to Silicon Valley's large language models. [ more ]

The Verge

Information security

Microsoft and OpenAI say hackers are using ChatGPT to improve cyberattacks

Hackers are using large language models like ChatGPT to refine and improve their cyberattacks.

Nation-backed groups from Russia, North Korea, Iran, and China are utilizing language models for research, scripting, and phishing emails. [ more ]

moreai-technology

Nature

#artificial-general-intelligence

Artificial intelligence

AI & robotics briefing: How AI is improving climate forecasts

AI algorithms can improve early warning systems for invasive species like Asian hornets.

Language models like GPT-4 can generate harmful responses if fed with numerous negative examples. [ more ]

TNW | Deep-Tech

Artificial intelligence

Meta's AI chief: LLMs will never reach human-level intelligence

AGI predictions vary widely, with some industry leaders suggesting it could arrive within five years while others, like Yann LeCun, argue that human-level AI is a more feasible goal.

Current AI systems lack key cognitive capabilities essential for human-like intelligence, such as reasoning, planning, memory, and understanding the physical world. [ more ]

www.scientificamerican.com

Artificial intelligence

What the Quest to Build a Truly Intelligent Machine Is Teaching Us

The goal of artificial intelligence is to achieve artificial general intelligence (AGI) with humanlike adaptability and creativity.

Large language models have excelled in language processing but still lack the capacity for open-ended learning and other cognitive functions. [ more ]

time.com

moreartificial-general-intelligence

Artificial intelligence

When Might AI Outsmart Us? It Depends Who You Ask

Shane Legg, Google DeepMind's co-founder, estimates a 50% chance of artificial general intelligence (AGI) being developed by 2028.

GPT-4, a language model developed by OpenAI, scored higher on a standardized test than GPT-3.5, showing progress in AI capabilities. [ more ]

Theregister

Artificial intelligence

AI datacenters might consume 25% of US electricity by 2030

AI datacenters could consume significant electricity by 2030, driven by popular language models like ChatGPT.

Efficiency improvements are crucial for managing the increasing power consumption of AI datacenters. [ more ]

Import AI

Artificial intelligence

Import AI

Using the PowerInfer method, language models can be made more efficient by offloading some neurons to GPU and the rest to CPU.

PowerInfer offers significant efficiency improvements over previous methods by utilizing a power law distribution of neuron activation in language models. [ more ]

www.nytimes.com

Artificial intelligence

Opinion | A.I.-Generated Garbage Is Polluting Our Culture

A.I.-generated outputs are influencing our culture beyond screens.

Adjectives associated with A.I.-generated text are increasingly used in scientific paper peer reviews about A.I. [ more ]

#artificial-intelligence

ScienceDaily

Artificial intelligence

Engineering household robots to have a little common sense

Robots learn household tasks through imitation but struggle to handle disruptions.

MIT engineers develop a method connecting robot motion data with 'common sense knowledge' of large language models. [ more ]

www.theguardian.com

Artificial intelligence

As AI tools get smarter, they're growing more covertly racist, experts find

AI models like ChatGPT and Gemini hold racist stereotypes about speakers of AAVE.

AI models disproportionately label AAVE speakers as less intelligent and employable. [ more ]

ReadWrite

moreartificial-intelligence

Artificial intelligence

Gemini 1.5: Google's new AI model already has a major update

Google has released Gemini 1.5, a new version of its multimodal large language models.

Gemini 1.5 can process up to one million tokens of information and achieves a breakthrough in long-context understanding. [ more ]

#ai-bias

www.theguardian.com

Artificial intelligence

As AI tools get smarter, they're growing more covertly racist, experts find

AI language models like ChatGPT and Gemini hold racist stereotypes about AAVE speakers.

AI systems react to less overt markers of race, affecting job applicants using AAVE. [ more ]

New Scientist

Artificial intelligence

AI chatbots use racist stereotypes even after anti-racism training

Large language models demonstrate racial prejudice against African American English speakers

Commercial AI chatbots show hidden bias which could impact employment and criminal justice decisions. [ more ]

moreai-bias

#chatgpt

Marketplace

Artificial intelligence

AI can't handle the truth when it comes to the law - Marketplace

One in five lawyers use AI

Legal language models like ChatGPT have high hallucination rates [ more ]

TechCrunch

Artificial intelligence

Mistral AI releases new model to rival GPT-4 and its own chat assistant | TechCrunch

Mistral AI introduces Mistral Large to compete with top language models like GPT-4 and Claude 2.

Mistral AI launches a ChatGPT alternative called Le Chat and adopts OpenAI-like business model. [ more ]

The Economist

Artificial intelligence

Why AI needs to learn new languages

ChatGPT, a chatbot developed by Open AI, performs well in English but struggles in other languages.

Large language models (LLMs) are predominantly trained on English text, which limits their performance in low-resource languages. [ more ]

The Conversation

Artificial intelligence

Google's Gemini: is the new AI model really better than ChatGPT?

Google DeepMind has announced Gemini, a new AI model designed to compete with OpenAI's ChatGPT.

Gemini is a multimodal model that can work with text, images, audio, and video as input and output. [ more ]

morechatgpt

Nature

Artificial intelligence

Chatbot AI makes racist judgements on the basis of dialect

Language models display racist bias based on users' dialect.

Retrospective human feedback does not address covert racism in AI models. [ more ]

Adweek

Marketing

Marketers Are Tracking a New Metric: Share of Model

AI-powered chat programs will repeat bad reviews in response to search queries.

Marketers need to track how advanced language models perceive their brand.

Reviewing creative assets is crucial for brands to advocate for their products in an AI-dominated world. [ more ]

Theregister

Artificial intelligence

Boffins caution against running robots on AI models

Robot makers urged to conduct further safety research before integrating language and vision models into hardware

Caution advised due to potential risks in integrating models like GPT-3.5/4 and PaLM-2L with robots. [ more ]

#prompt-engineering

Theregister

Data science

Prompt engineering is a task best left to AI models

Prompt engineering is crucial for improving chatbot responses.

Positive thinking prompts can enhance model performance, but testing them scientifically is computationally challenging. [ more ]

Medium

Artificial intelligence

Meta Introduces 'Prompt Engineering with Llama 2'

Meta AI has introduced an interactive guide called 'Prompt Engineering with Llama 2' to elevate the skills of developers, researchers, and enthusiasts in the domain of large language models

The guide provides hands-on experience in prompt engineering, which involves crafting inputs to guide language models to produce desired outputs [ more ]

moreprompt-engineering

#fine-tuning

Medium

Data science

10 Datasets for Fine-Tuning Large Language Models

Fine-tuning or additional training can optimize performance of large language models for specific tasks or domains.

The NVIDIA HelpSteer dataset can be valuable for fine-tuning LLMs to generate clear and concise instructions for autonomous vehicles. [ more ]

Medium

Data science

Researchers Introduce Proxy-Tuning: An Efficient Alternative to Finetuning Large Language Models

Researchers have introduced a method called proxy-tuning to streamline the adaptation of large pretrained LMs efficiently.

Proxy-tuning is a lightweight, decoding-time algorithm that involves tuning a smaller language model and applying the predictive differences to shift the predictions toward the desired goal. [ more ]

morefine-tuning

Computing

Artificial intelligence

Nvidia CEO advocates for 'Sovereign AI'

Huang emphasizes the concept of 'sovereign AI' as an opportunity for global leaders.

The UAE is focused on creating large language models and mobilizing compute. [ more ]

#ai-models

www.fastcompany.com

Artificial intelligence

Meta's new AI model learns by watching videos

Meta's AI researchers have developed a new model called V-JEPA that learns from video instead of words.

The model aims to mimic the way children learn about the world through visual and auditory input. [ more ]

CNET

Digital life

The Race to Move Beyond Phone Apps Was In Full Swing at CES 2024

Voice assistants fueled by ChatGPT and mixed reality headsets are changing how we interact with apps.

CES 2024 showcased new implementations of AI models in hardware that could eliminate the need for traditional apps. [ more ]

moreai-models

TechCrunch

Artificial intelligence

Kong's new open source AI Gateway makes building multi-LLM apps easier | TechCrunch

Kong is launching an open source AI Gateway as an extension of its existing API gateway to integrate applications with large language models.

The AI Gateway includes features for prompt engineering, credential management, and more to make building on AI more productive for developers. [ more ]

The Verge

Artificial intelligence

What AI can do for historians

Language models like ChatGPT can be used to transcribe and translate handwritten texts

AI tools can aid in extracting relevant information from digitized archives and libraries [ more ]

#gpt-4

Medium

Artificial intelligence

Learn About LLMs With These ODSC East 2024 Sessions

Large Language Models (LLMs) are transforming the world and the field of data science at an unprecedented pace.

The ODSC East conference offers training sessions and workshops focused on LLMs, including topics like NLP with GPT-4 and enabling complex reasoning with LLMs. [ more ]

Harvard Business Review

Artificial intelligence

How Data Collaboration Platforms Can Help Companies Build Better AI

Data collaboration platforms can address data quality, bias, and privacy concerns

Off-the-shelf language models often underperform in unique organizational contexts [ more ]

moregpt-4

Theregister

Artificial intelligence

Boffins find AI models tend to escalate conflicts

AI decision-making in military and diplomatic matters can skew towards nuclear war

A team of researchers assessed how language models handle international conflict simulations [ more ]

www.fastcompany.com

Artificial intelligence

Why data will always be a precious commodity in the AI world

The New York Times lawsuit against OpenAI highlights the question of data ownership and fair use in training language models.

OpenAI admitted in a submission to the House of Lords that it requires access to copyrighted work to train its language models. [ more ]

Theregister

Artificial intelligence

JetBrains' unremovable AI assistant prompts customer outcry

JetBrains users are looking for a way to remove the AI Assistant plugin from their applications.

There are concerns about the security, legal risk, privacy, and ethics of large language models used in AI assistants. [ more ]

Exchangewire

Marketing

Retail Media Grows its Share of Total US Ad Spend; Amazon & iRobot Terminate Acquisition Agreement; China Approves 14 Large Language Models

Retail media ad spend is growing faster than search or social and is predicted to make up over one-fifth of total US ad spending by 2027.

Amazon has terminated its $1.4 billion deal to acquire iRobot due to opposition from the European Union.

China has approved 14 large language models for commercial use, pushing AI to boost efficiency in enterprises. [ more ]

InfoQ

Artificial intelligence

Stability AI Releases 1.6 Billion Parameter Language Model Stable LM 2

Stability AI has released pre-trained model weights for the Stable LM 2 language model, a 1.6B parameter model trained on 2 trillion tokens of text data from seven languages.

The model is available in two versions: the base model and an instruction-tuned version called Stable LM 2 Zephyr. [ more ]

Smashing Magazine

Data science

A Simple Guide To Retrieval Augmented Generation Language Models - Smashing Magazine

Language models can suffer from 'hallucinations' and provide inaccurate or outdated information.

Retrieval Augmented Generation (RAG) is a framework designed to address these limitations by incorporating relevant, up-to-date data. [ more ]

Jqueryscript

Web design

Weekly Web Design & Development News: Collective #537

Placemark is an open-source web application for geospatial data.

NLUX is a Javascript library for integrating language models into web apps. [ more ]

#chatbots

Acm

Artificial intelligence

Do You Think the Chatbot Likes Me?

Chatbots are becoming more human-like and users often perceive them as having a personality.

Researchers are trying to understand chatbot personalities and how they can be shaped. [ more ]

TechBeamers

Python

Understanding LangChain: A Guide for Beginners

LangChain is a toolkit for building apps powered by large language models like GPT-3.

It simplifies connecting language models to build text generators, chatbots, and more. [ more ]

morechatbots

TechCrunch

Business intelligence

TextQL aims to add AI-powered intelligence on top of business data | TechCrunch

TextQL is a platform that connects a company's data stack to language models, allowing business teams to ask questions of their data on-demand.

The platform aims to address the challenges faced by data leaders and business teams in understanding and accessing data. [ more ]

TechCrunch

Artificial intelligence

Google's new Gemini-powered conversational tool helps advertisers quickly build Search campaigns | TechCrunch

Google's multimodal large language models, Gemini, now power the conversational experience within Google Ads, making it easier for advertisers to build and scale Search ad campaigns.

The conversational experience in Google Ads uses a chat-based tool that generates relevant ad content, including assets and keywords, based on a website URL. It also suggests images using generative AI.

Beta access to the conversational experience is currently available to English language advertisers in the US and UK, with global access opening up in the next few weeks and plans to expand to additional languages in the future. [ more ]

Ars Technica

OMG science

DeepMind AI rivals the world's smartest high schoolers at geometry

Google's DeepMind has developed AlphaGeometry, which achieved a high level of performance on geometry problems.

AlphaGeometry combines a language model with a traditional symbolic deduction engine to overcome limitations in reasoning and explanation. [ more ]

#funding

The Economic Times

Artificial intelligence

Ex-Twitter CEO Parag Agrawal raises $30 million for his AI startup: Report

Former Twitter CEO Parag Agrawal has raised $30 million for his AI startup.

The funding was led by Khosla Ventures, with participation from Index Ventures and First Round Capital. [ more ]

The Times of India

SpirosMargaris 5 months ago

Artificial intelligence

Minuscule AI startup raises $41 million to tap India growth - Times of India

Indian AI startup Sarvam AI raises $41 million in funding round, largest for an early-stage AI company in India

Sarvam AI aims to build affordable language models for unique uses in Indian languages [ more ]

morefunding

2023 was the year

that #ArtificialIntelligence went #mainstream

https://t.co/1pHOFzKlaJ #fintech #AI #MachineLearning #GenerativeAI @AlexWilkins22 @newscientist

New Scientist

Artificial intelligence

2023 was the year that artificial intelligence went mainstream

ChatGPT emerged as a popular AI tool with wide-ranging abilities

AI technology is advancing towards being more multimodal [ more ]

TechRepublic

Artificial intelligence

Microsoft Research Debuts Phi-2, New Small Language Model

Microsoft Research has developed Phi-2, a 2.7 billion-parameter language model for natural language and coding.

Phi-2 performs better than some larger language models on certain tests. [ more ]

#language models

www.cnbc.com

Artificial intelligence

Meta's AI chief doesn't think AI super intelligence is coming anytime soon, and is skeptical on quantum computing

Yann LeCun believes current AI systems are decades away from reaching sentience and common sense capabilities.

LeCun believes the technology industry's current focus on language models and text data will not be enough to create advanced human-like AI systems. [ more ]

TNW | Deep-Tech

Artificial intelligence

Google's Gemini AI won't be available in Europe - for now

Google has launched its new generative AI models called Gemini, which it claims to be the "most capable model ever."

Gemini models are trained to recognize, understand, and combine text, images, audio, video, and code. [ more ]

Theregister

Artificial intelligence

Google unveils TPU v5p pods to accelerate AI training

Google has revealed its new performance-optimized chip, the TPU v5p, designed to reduce training time for large language models.

The TPU v5p is Google's most powerful chip yet, capable of pushing 459 teraFLOPS of performance and backed by 95GB of high bandwidth memory. [ more ]

ScienceDaily

6 months ago

Artificial intelligence

AI can 'lie and BS' like its maker, but still not intelligent like humans

AI technology like ChatGPT is seen as both advantageous and potentially dangerous.

AI systems like ChatGPT are different from human cognition because they lack embodiment and don't understand the meaning of what they say. [ more ]

time.com

6 months ago

Artificial intelligence

AI and the Rise of Mediocrity

Artificial intelligence is not conscious or intelligent, but rather language and image models that predict patterns based on previous data.

AI tools are effective at regurgitating commonplace information, making lists, organizing notes, and generating basic content. [ more ]

www.vox.com

6 months ago

Artificial intelligence

Why it's important to remember that AI isn't human

ChatGPT remains a topic of debate among experts, with opinions ranging from it being a potential threat to civilization to it being a sophisticated auto-complete tool.

The emergence of language models like ChatGPT raises questions about the link between language and the mind, and whether a new form of mind has been created.

Interacting with chatbots can be misleading due to the ambiguity in language, requiring us to rely on our intention-guessing mechanism for effective communication. [ more ]

morelanguage models

#Language models

The Verge

Artificial intelligence

Google launches Gemini, the AI model it hopes will take down GPT-4

Google has launched its latest large language model called Gemini, which will have a significant impact on the company's products.

Gemini includes different versions such as Nano, Pro, and Ultra, each designed for specific use cases. [ more ]

TNW | Deep-Tech

Artificial intelligence

Silo AI releases checkpoint on mission to democratise LLMs

Large language models work more effectively in English, creating language bias and limiting access to knowledge and innovation in other languages.

Silo AI has released the multilingual open European LLM Poro 34B, which has shown best-in-class performance for low-resource languages like Finnish. [ more ]

moreLanguage models

#Language Models

WIRED

Artificial intelligence

A New Trick Uses AI to Jailbreak AI Models-Including GPT-4

Large language models like ChatGPT have become popular among developers, with over 2 million using OpenAI's APIs.

These models can exhibit biases and fabricate information, leading to potential misuse and the need for safeguards. [ more ]

CodeProject