AVISARME: Audio Visual Synchronization Algorithm for a Robotic Musician Ensemble

Berman, David Ross

AVISARME: Audio Visual Synchronization Algorithm for a Robotic Musician Ensemble

dc.contributor.advisor	Chopra, Nikhil	en_US
dc.contributor.author	Berman, David Ross	en_US
dc.contributor.department	Mechanical Engineering	en_US
dc.contributor.publisher	Digital Repository at the University of Maryland	en_US
dc.contributor.publisher	University of Maryland (College Park, Md.)	en_US
dc.date.accessioned	2012-10-10T11:33:43Z
dc.date.available	2012-10-10T11:33:43Z
dc.date.issued	2012	en_US
dc.description.abstract	This thesis presents a beat detection algorithm which combines both audio and visual inputs to synchronize a robotic musician to its human counterpart. Although there has been considerable work done to create sophisticated methods for audio beat detection, the visual aspect of musicianship has been largely ignored. With advancements in image processing techniques, as well as both computer and imaging technologies, it has recently become feasible to integrate visual inputs into beat detection algorithms. Additionally, the proposed method for audio tempo detection also attempts to solve many issues that are present in current algorithms. Current audio-only algorithms have imperfections, whether they are inaccurate, too computationally expensive, or suffer from terrible resolution. Through further experimental testing on both a popular music database and simulated music signals, the proposed algorithm performed statistically better in both accuracy and robustness than the baseline approaches. Furthermore, the proposed approach is extremely efficient, taking only 45ms to compute on a 2.5s signal, and maintains an extremely high temporal resolution of 0.125 BPM. The visual integration also relies on Full Scene Tracking, allowing it to be utilized for live beat detection for practically all musicians and instruments. Numerous optimization techniques have been implemented, such as pyramidal optimization (PO) and clustering techniques which are presented in this thesis. A Temporal Difference Learning approach to sensor fusion and beat synchronization is also proposed and tested thoroughly. This TD learning algorithm implements a novel policy switching criterion which provides a stable, yet quickly reacting estimation of tempo. The proposed algorithm has been implemented and tested on a robotic drummer to verify the validity of the approach. The results from testing are documented in great detail and compared with previously proposed approaches.	en_US
dc.identifier.uri	http://hdl.handle.net/1903/13077
dc.subject.pqcontrolled	Robotics	en_US
dc.subject.pqcontrolled	Music	en_US
dc.subject.pquncontrolled	Beat	en_US
dc.subject.pquncontrolled	Detection	en_US
dc.subject.pquncontrolled	Musician	en_US
dc.subject.pquncontrolled	Robotic	en_US
dc.subject.pquncontrolled	Tempo	en_US
dc.subject.pquncontrolled	Visual	en_US
dc.title	AVISARME: Audio Visual Synchronization Algorithm for a Robotic Musician Ensemble	en_US
dc.type	Thesis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Berman_umd_0117N_13515.pdf
Size:: 1.86 MB
Format:: Adobe Portable Document Format

Download

Collections

UMD Theses and Dissertations
Mechanical Engineering Theses and Dissertations