Ai2

Team

non-profit

Verified

https://allenai.org/

allen_ai

allenai

AI & ML interests

Building breatkthrough AI to solve the world's biggest problems.

Recent Activity

baileyk updated a dataset about 12 hours ago

allenai/dolma3_mix-6T-1025

faezeb authored a paper 22 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

undfined authored a paper 22 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

View all activity

Papers

Olmo 3

OlmoEarth: Stable Latent Image Modeling for Multimodal Earth Observation

View all Papers

baileyk

updated a dataset about 12 hours ago

allenai/dolma3_mix-6T-1025

Preview • Updated about 12 hours ago • 28.9k • 14

baileyk

updated a dataset about 13 hours ago

allenai/dolma3_mix-5.5T-1125

Viewer • Updated about 13 hours ago • 218k • 638 • 7

faezeb

authored a paper 22 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 23 days ago • 57

sewon

authored a paper 22 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 23 days ago • 57

pradeepd

authored a paper 22 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 23 days ago • 57

shannons

authored 3 papers 22 days ago

SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature

Paper • 2406.07835 • Published Jun 10, 2024 • 2

SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models

Paper • 2510.09541 • Published Oct 10 • 15

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 23 days ago • 57

TTTXXX01

authored a paper about 2 months ago

Inference-time Alignment in Continuous Space

Paper • 2505.20081 • Published May 26

yanhong-l

authored a paper about 2 months ago

Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs

Paper • 2510.18279 • Published Oct 21 • 4

valentinhofmann

authored a paper 3 months ago

Large Language Models Discriminate Against Speakers of German Dialects

Paper • 2509.13835 • Published Sep 17 • 7

ashish333

authored 9 papers 5 months ago

Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge

Paper • 1803.05457 • Published Mar 14, 2018 • 3

Can a Suit of Armor Conduct Electricity? A New Dataset for Open Book Question Answering

Paper • 1809.02789 • Published Sep 8, 2018

From 'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project

Paper • 1909.01958 • Published Sep 4, 2019

Probing Natural Language Inference Models through Semantic Fragments

Paper • 1909.07521 • Published Sep 16, 2019

QASC: A Dataset for Question Answering via Sentence Composition

Paper • 1910.11473 • Published Oct 25, 2019

What Does My QA Model Know? Devising Controlled Probes using Expert Knowledge

Paper • 1912.13337 • Published Dec 31, 2019

UnifiedQA: Crossing Format Boundaries With a Single QA System

Paper • 2005.00700 • Published May 2, 2020

Language Models with Rationality

Paper • 2305.14250 • Published May 23, 2023

Attentiveness to Answer Choices Doesn't Always Entail High QA Accuracy

Paper • 2305.14596 • Published May 24, 2023 • 1