Search

To excel at engineering design, generative AI must learn to innovate, study finds

AI models that prioritize similarity falter when asked to design something completely new.

‘How’s the Air?’ Using AI to Track Coal Train Dust

Scientists in California are working with communities — and a suite of AI tools — to better understand air pollution.

Learn Your Tokens: Word-Pooled Tokenization for Language Modeling

Language models typically tokenize text into subwords, using a deterministic, hand-engineered heuristic of combining characters into longer surface-level strings such as 'ing' or whole words. Recent literature has repeatedly shown the limitations of such a tokenization strategy, particularly for…

Multi-view Contrastive Learning for Entity Typing over Knowledge Graphs

Knowledge graph entity typing (KGET) aims at inferring plausible types of entities in knowledge graphs. Existing approaches to KGET focus on how to better encode the knowledge provided by the neighbors and types of an entity into its representation. However, they ignore the semantic knowledge...

AI Alone Won’t Solve the Problem of Antibiotic Resistance

Replacing frontline workers with AI can be a bad idea — here’s why

Towards Abdominal 3-D Scene Rendering from Laparoscopy Surgical Videos using NeRFs

Given that a conventional laparoscope only provides a two-dimensional (2-D) view, the detection and diagnosis of medical ailments can be challenging. To overcome the visual constraints associated with laparoscopy, the use of laparoscopic images and videos to reconstruct the three-dimensional (3-D)...

How Much Consistency Is Your Accuracy Worth?

Contrast set consistency is a robustness measurement that evaluates the rate at which a model correctly responds to all instances in a bundle of minimally different examples relying on the same knowledge. To draw additional insights, we propose to complement consistency with relative consistency --...

Explained: Generative AI

How do powerful generative AI systems like ChatGPT work, and what makes them different from other types of artificial intelligence?

Biden’s executive order puts civil rights in the middle of the AI regulation discussion

On Oct. 4, 2022, the White House Office of Science and Technology Policy released the Blueprint for an AI Bill of Rights: A Vision for Protecting Our Civil Rights in the Algorithmic Age. The blueprint launched a conversation about how artificial intelligence innovation can proceed under multiple…

Unveiling Safety Vulnerabilities of Large Language Models

As large language models become more prevalent, their possible harmful or inappropriate responses are a cause for concern. This paper introduces a unique dataset containing adversarial examples in the form of questions, which we call AttaQ, designed to provoke such harmful or inappropriate responses...

OtterHD: A High-Resolution Multi-modality Model

In this paper, we present OtterHD-8B, an innovative multimodal model evolved from Fuyu-8B, specifically engineered to interpret high-resolution visual inputs with granular precision. Unlike conventional models that are constrained by fixed-size vision encoders, OtterHD-8B boasts the ability to...

Long hours and low wages: the human labour powering AI’s development

The Finnish tech firm Metroc recently began using prison labour to train a large language model to improve artificial intelligence (AI) technology. For 1.54 euros an hour prisoners answer simple questions about snippets of text in a process known as data labelling.

Data labelling is often…

Using AI to optimize for rapid neural imagin

MIT CSAIL researchers combine AI and electron microscopy to expedite detailed brain network mapping, aiming to enhance connectomics research and clinical pathology.

Comparing Humans, GPT-4, and GPT-4V On Abstraction and Reasoning Tasks

We explore the abstract reasoning abilities of text-only and multimodal versions of GPT-4, using the ConceptARC benchmark [10], which is designed to evaluate robust understanding and reasoning with core-knowledge concepts. We extend the work of Moskvichev et al. [10] by evaluating GPT-4 on more...

How AI could reveal secrets of thousands of handwritten documents – from medieval manuscripts to hieroglyphics

Over the last ten years, researchers have gradually been working out how to teach computers to read handwritten documents. As in most machine learning, a computer is fed training data: in this case, images of handwriting and details of what it says. It then learns how the marks on each page…

New method uses crowdsourced feedback to help train robots

Human Guided Exploration (HuGE) enables AI agents to learn quickly with some help from humans, even if the humans make mistakes.

Filter bubbles and affective polarization in user-personalized large language model outputs

Echoing the history of search engines and social media content rankings, the advent of large language models (LLMs) has led to a push for increased personalization of model outputs to individual users. In the past, personalized recommendations and ranking systems have been linked to the development...

Towards Publicly Accountable Frontier LLMs: Building an External Scrutiny Ecosystem under the ASPIRE Framework

With the increasing integration of frontier large language models (LLMs) into society and the economy, decisions related to their training, deployment, and use have far-reaching implications. These decisions should not be left solely in the hands of frontier LLM developers. LLM users, civil society...

AI accelerates problem-solving in complex scenarios

A new, data-driven approach could lead to better solutions for tricky optimization problems like global package routing or power grid operation.

Stay in the loop

Subscribe to our newsletter for a weekly update on the latest podcast, news, events, and jobs postings.