Bird Identification
YouTube ... Quora ...Google search ...Google News ...Bing News
- Merlin
- Capabilities
- End-to-End Speech ... Synthesize Speech ... Speech Recognition ... Music
- Video/Image ... Vision ... Colorize ... Image/Video Transfer Learning
Photo ID: Merlin Photo ID uses computer vision technology, developed as part of Dr. Grant Van Horn’s doctoral work at Caltech, to identify birds in photos. Photo ID was developed in collaboration with Dr. Pietro Perona’s computational vision lab at Caltech, and Dr. Serge Belongie’s computer vision group at Cornell Tech, collaborators on the Visipedia project. First publicly released Nov 30th, 2017.
Sound ID Sound ID uses recordings archived in the Macaulay Library to learn how to recognize the vocalizations of different bird species. Sound ID is trained on audio recordings that are first converted to visual representations (spectrograms), then analyzed using computer vision tools similar to those that power Photo ID. Dataset preparation began in 2020 with model development starting in early 2021. Sound ID was developed in-house at the Cornell Lab of Ornithology, led by Dr. Grant Van Horn with assistance from Dr. Benjamin Hoffman. S We thank the many annotators that helped curate hundreds of audio recordings for each species. First publicly release June 23rd, 2021.