Deepmind – Page 21

An empirical analysis of compute-optimal large language model training

April 12, 2022

by Deepmind

We ask the question: “What is the optimal model size and number of training tokens for a given compute budget?” To answer this question, we train models of various sizes and with various numbers of tokens, and estimate this trade-off empirically. Our main finding is that the current large language models are far too large for their compute budget and are not being trained on enough data.Read More

An empirical analysis of compute-optimal large language model training

April 12, 2022

by Deepmind

GopherCite: Teaching language models to support answers with verified quotes

March 16, 2022

by Deepmind

Language models like Gopher can “hallucinate” facts that appear plausible but are actually fake. Those who are familiar with this problem know to do their own fact-checking, rather than trusting what language models say. Those who are not, may end up believing something that isn’t true. This paper describes GopherCite, a model which aims to address the problem of language model hallucination. GopherCite attempts to back up all of its factual claims with evidence from the web.Read More

GopherCite: Teaching language models to support answers with verified quotes

March 16, 2022

by Deepmind

Predicting the past with Ithaca

March 9, 2022

by Deepmind

The birth of human writing marked the dawn of History and is crucial to our understanding of past civilisations and the world we live in today. For example, more than 2,500 years ago, the Greeks began writing on stone, pottery, and metal to document everything from leases and laws to calendars and oracles, giving a detailed insight into the Mediterranean region. Unfortunately, it’s an incomplete record. Many of the surviving inscriptions have been damaged over the centuries or moved from their original location. In addition, modern dating techniques, such as radiocarbon dating, cannot be used on these materials, making inscriptions difficult and time-consuming to interpret.Read More

Predicting the past with Ithaca

March 9, 2022

by Deepmind

Restoring, placing and dating ancient texts through collaboration between AI and historians.Read More

Learning Robust Real-Time Cultural Transmission without Human Data

March 3, 2022

by Deepmind

In this work, we use deep reinforcement learning to generate artificial agents capable of test-time cultural transmission. Once trained, our agents can infer and recall navigational knowledge demonstrated by experts. This knowledge transfer happens in real time and generalises across a vast space of previously unseen tasks.Read More

Learning Robust Real-Time Cultural Transmission without Human Data

March 3, 2022

by Deepmind

Probing Image-Language Transformers for Verb Understanding

February 23, 2022

by Deepmind

Multimodal Image-Language transformers have achieved impressive results on a variety of tasks that rely on fine-tuning (e.g., visual question answering and image retrieval). We are interested in shedding light on the quality of their pretrained representations–in particular, if these models can distinguish verbs or they only use the nouns in a given sentence. To do so, we collect a dataset of image-sentence pairs consisting of 447 verbs that are either visual or commonly found in the pretraining data (i.e., the Conceptual Captions dataset). We use this dataset to evaluate the pretrained models in a zero-shot way.Read More

Probing Image-Language Transformers for Verb Understanding

February 23, 2022

by Deepmind

Vedere AI

Posts in category: Deepmind

An empirical analysis of compute-optimal large language model training

An empirical analysis of compute-optimal large language model training

GopherCite: Teaching language models to support answers with verified quotes

GopherCite: Teaching language models to support answers with verified quotes

Predicting the past with Ithaca

Predicting the past with Ithaca

Learning Robust Real-Time Cultural Transmission without Human Data

Learning Robust Real-Time Cultural Transmission without Human Data

Probing Image-Language Transformers for Verb Understanding

Probing Image-Language Transformers for Verb Understanding

Navigation

GenAI Vision Endless Possibilities

"I'm interested in things that change the world or that affect the future and wondrous, new technology where you see it, and you're like, 'Wow, how did that even happen? How is that possible?'" -- Elon Musk

Copyright © 2019-2025 Vedere AI. All Rights Reserved.