Deep search
Search
Copilot
Images
Videos
Maps
News
Shopping
More
Flights
Travel
Hotels
Real Estate
Notebook
Top stories
Sports
U.S.
2024 Election
Local
World
Science
Technology
Entertainment
Business
More
Politics
Past hour
Any time
Past 24 hours
Past 7 days
Past 30 days
Best match
Most recent
Hosted on MSN
35m
Can Language Models Stop Making Stuff Up? New OpenAI Benchmark Puts AI to the Test
Research: Measuring short-form factuality in large language models. Image Credit: Shutterstock AI (PDF) In an article ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results
Trending now
Picked as defense secretary
Ratcliffe named CIA director
15 years for Pentagon leaker
US ambassador to Israel
NY judge delays key ruling
Takes job outside WH
Picked as DHS secretary
CA fuel prices skyrocketing?
Yale to offer new course
Ex-Notre Dame coach dies
New research on Uranus
CA noncitizen vote rejected
NYT tech workers end strike
SCOTUS rejects appeal
Tubman honored as general
Emperor penguin found
October deliveries plunge
Flights to Haiti suspended
Dozens killed in China
St. Peter's Basilica uses AI
Pandemic drinking study
Costco butter recalled
Shell wins climate case
Resigns over abuse scandal
LA law blocked temporarily
EPA to charge methane fee
Israel misses aid deadline?
Suspect waives jury trial
Meets w/ president of Israel
Northeast red flag warnings
STD epidemic slows
Related topics
Artificial intelligence
Greg Brockman
Microsoft
Sam Altman
Lilian Weng
Feedback