The CETI project: understanding the language of sperm whales using artificial intelligence tools

0
The CETI project: understanding the language of sperm whales using artificial intelligence tools

Understanding the language used by animals is one of the challenges facing some scientists: this is the case of Denise Herzing, who studies the language of dolphins through the Wild Dolphin Project. The Cetacean Translation Initiative (CETI) aims to listen to and understand the language of sperm whales. To do this, they are using artificial intelligence and more specifically machine learning to try to decode their songs.

A research project to understand the language of sperm whales

Sperm whales are undoubtedly among the noisiest group of living beings on Earth. To communicate with other sperm whales or other species of whales, this immense mammal produces jerky clicks, clicks or cracks that can be heard by other species sometimes hundreds of kilometres away. Their messages can be brief, lasting only a few seconds, or sometimes more than half an hour.

These songs are known to scientists as “codas” and they wonder if these patterns are complex enough to be considered a language in its own right. As part of the interdisciplinary CETI project, several researchers presented a plan to decode the songs of sperm whales. Their roadmap has beenpublished in a paper by Jacob Andreas of the Massachusetts Institute of Technology (MIT) and Robert J. Wood of the Marine Biology Laboratory and Department at the University of Aarhus, Denmark.

Machine learning, robotics, drones, automated channels: the tools used by the CETI project

At present, the CETI project has a database of recordings corresponding to about 100,000 sperm whale clicks. They have been collected by marine biologists over several years. Despite this relatively large dataset, the machine learning models developed by the researchers may need about 4 million clicks to really understand the songs, as well as its redundancies and particularities.

However, to fill this gap, the CETI project has set up numerous automated channels to collect recordings of sperm whale songs. Underwater microphones placed in areas of the ocean where these mammals typically congregate and dropped by drones that spot a group of sperm whales will be able to accomplish this task. In addition, aquatic robots will be able to track and listen to these marine species discreetly from a distance.

Challenges to the CETI project

Although the project’s researchers are confident, they do not hide the fact that there are difficulties that could put obstacles in their way:

  • It can be difficult to know when a word starts and when it ends. Indeed, just like when we listen to someone speak in a foreign language that we don’t understand, it is very complicated to distinguish each word in a sentence. Machine learning algorithms find audio more difficult to analyze than text.
  • Even though we speak different languages around the world, we often talk about the same news, we all have the same topics of discussion that can come up. So, even if we have difficulty understanding a language, we can still be sure that the words spoken by a person whose language we don’t understand are not far from the words we may say in our daily lives with friends or family. For sperm whales, it’s different because they live in the oceans, a world that we don’t come into contact with much.
  • According to a 2016 study, Shane Gero, Hal Whitehead, and Luke Rendell analyzed the songs of nine different groups of sperm whales. Their conclusion: sperm whales would have different dialects. An additional difficulty to take into account.

Despite these difficulties, the researchers are not giving up and are enjoying the study while hoping that if it works, their methods can be applied to other animals.

Translated from Le projet CETI : comprendre la langue des cachalots grâce à des outils d’intelligence artificielle