Search

Amazon rainforest photosynthesis increases in response to atmospheric dryness

Earth system models predict that increases in atmospheric and soil dryness will reduce photosynthesis in the Amazon rainforest, with large implications for the global carbon cycle. Using in situ observations, solar-induced fluorescence, and nonlinear machine learning techniques, we show that, in...

Power-law scaling to assist with key challenges in artificial intelligence

Power-law scaling, a central concept in critical phenomena, is found to be useful in deep learning, where optimized test errors on handwritten digit examples converge as a power-law to zero with database size. For rapid decision making with one training epoch, each example is presented only once to...

CommonGen: A Constrained Text Generation Challenge for Generative Commonsense Reasoning

Recently, large-scale pre-trained language models have demonstrated impressive performance on several commonsense-reasoning benchmark datasets. However, building machines with commonsense to compose realistically plausible sentences remains challenging. In this paper, we present a constrained text...

Understanding Time Series with R

Analyzing time series is such a useful resource for essentially any business, data scientists entering the field should bring with them a solid foundation in the technique. Here, we decompose the logical components of a time series using R to better understand how each plays a role in this type of...

Shrinking massive neural networks used to model language

A new approach could lower computing costs and increase accessibility to state-of-the-art natural language processing.

‘Rules as Code’ will let computers apply laws and regulations. But over-rigid interpretations would undermine our freedoms

Leveraging multi-way interactions for systematic prediction of pre-clinical drug combination effects

We present comboFM, a machine learning framework for predicting the responses of drug combinations in pre-clinical studies, such as those based on cell lines or patient-derived cells. comboFM models the cell context-specific drug interactions through higher-order tensors, and efficiently learns...

Rapid tissue oxygenation mapping from snapshot structured-light images with adversarial deep learning

Significance: Spatial frequency-domain imaging (SFDI) is a powerful technique for mapping tissue oxygen saturation over a wide field of view. However, current SFDI methods either require a sequence of several images with different illumination patterns or, in the case of single-snapshot optical...

Interpretable Forward and Inverse Design of Particle Spectral Emissivity Using Common Machine-Learning Models

Radiative particles are ubiquitous in nature and in various technologies. Calculating radiative properties from known geometry and designs can be computationally expensive, and trying to invert the problem to come up with designs specific to desired radiative properties is even more challenging...

Kraken reveals itself – the merger history of the Milky Way reconstructed with the E-MOSAICS simulations

Globular clusters (GCs) formed when the Milky Way experienced a phase of rapid assembly. We use the wealth of information contained in the Galactic GC population to quantify the properties of the satellite galaxies from which the Milky Way assembled. To achieve this, we train an artificial neural...

First Steps of a Data Science Project

Many data science projects are launched with good intentions, but fail to deliver because the correct process is not understood. To achieve good performance and results in this work, the first steps must include clearly defining goals and outcomes, collecting data, and preparing and exploring the...

Refugees are at risk from dystopian ‘smart border’ technology

Neuroscientists find a way to make object-recognition models perform better

Adding a module that mimics part of the brain can prevent common errors made by computer vision models.

No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium

The existence of simple, uncoupled no-regret dynamics that converge to correlated equilibria in normal-form games is a celebrated result in the theory of multi-agent systems. Specifically, it has been known for more than 20 years that when all players seek to minimize their internal regret in a...

Improved guarantees and a multiple-descent curve for Column Subset Selection and the Nyström method

The Column Subset Selection Problem (CSSP) and the Nyström method are among the leading tools for constructing small low-rank approximations of large datasets in machine learning and scientific computing. A fundamental question in this area is: how well can a data subset of size k compete with the...

Language Models are Few-Shot Learners

We demonstrate that scaling up language models greatly improves task-agnostic, few-shot performance, sometimes even becoming competitive with prior state-of-the-art fine-tuning approaches. Specifically, we train GPT-3, an autoregressive language model with 175 billion parameters, 10x more than any...

HOGWILD!: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent

Stochastic Gradient Descent (SGD) is a popular algorithm that can achieve state-of-the-art performance on a variety of machine learning tasks. Several researchers have recently proposed schemes to parallelize SGD, but all require performance-destroying memory locking and synchronization. This work...

R squared Does Not Measure Predictive Capacity or Statistical Adequacy

The fact that R-squared shouldn't be used for deciding if you have an adequate model is counter-intuitive and is rarely explained clearly. This demonstration overviews how R-squared goodness-of-fit works in regression analysis and correlations, while showing why it is not a measure of statistical...

Building machines that better understand human goals

A new algorithm capable of inferring goals and plans could help machines better adapt to the imperfect nature of human planning.

How humans use objects in novel ways to solve problems

What's SSUP? The Sample, Simulate, Update cognitive model developed by MIT researchers learns to use tools like humans do.

Stay in the loop

Subscribe to our newsletter for a weekly update on the latest podcast, news, events, and jobs postings.