Vis enkel innførsel

dc.contributor.authorBakken, Rune Havnungnb_NO
dc.date.accessioned2014-12-19T13:41:06Z
dc.date.available2014-12-19T13:41:06Z
dc.date.created2014-01-03nb_NO
dc.date.issued2013nb_NO
dc.identifier683341nb_NO
dc.identifier.isbn978-82-471-4406-0 (printed ver.)nb_NO
dc.identifier.isbn978-82-471-4407-7 (electronic ver.)nb_NO
dc.identifier.urihttp://hdl.handle.net/11250/253611
dc.description.abstractCapturing human motion is a key problem in computer vision, because of the wide range of applications that can benefit from the acquired data. Motion capture is used to identify people by their gait, for interacting with computers using gestures, for improving the performance of athletes, for diagnosis of orthopaedic patients, and for creating virtual characters with more natural looking motions in movies and games. These are but a few of the possible applications of human motion capture. In some of the application areas mentioned above it is important that the data acquisition is unconstrained by the markers or wearable sensors traditionally used in commercial motion capture systems. Furthermore, there is a need for low latency and real-time performance for certain applications, for instance in perceptive user interfaces and gait recognition. Human pose estimation is defined as the process of estimating the configuration of the underlying skeletal structure of the human body. In this dissertation several algorithms that together form a real-time pose estimation pipeline are proposed. Images captured with a calibrated multi-camera system are input to the pipeline, and the 3D positions of 25 joints in the global coordinate frame are the resulting output. The steps of the pipeline are: a) subtract the background from the images to create silhouettes; b) reconstruct the volume occupied by the performer from the silhouette images; c) extract skeleton curves from the volume; d) identify extremities and segment the skeletal curves into body parts; and e) fit a model to the labelled data. The pipeline can initialise automatically, and can recover from errors in estimation. There are four main contributions of the research effort presented in this dissertation: a) a toolset for evaluating shape-from-silhouette-based pose estimation algorithms using synthetic motion sequences generated from real motion capture data; b) a fully parallel thinning algorithm implemented on the Graphics Processing Unit (GPU) that can skeletonise voxel volumes in real time; c) a real-time pose estimation algorithm that builds a tree structure segmented into body parts from skeleton data; and d) a constraint algorithm that can fit an articulated model to a labelled tree structure in real time.nb_NO
dc.languageengnb_NO
dc.publisherNorges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for datateknikk og informasjonsvitenskapnb_NO
dc.relation.ispartofseriesDoktoravhandlinger ved NTNU, 1503-8181; 2013:150nb_NO
dc.titleImproving real-time human pose estimation from multi-view videonb_NO
dc.typeDoctoral thesisnb_NO
dc.contributor.departmentNorges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for datateknikk og informasjonsvitenskapnb_NO
dc.description.degreePhD i informasjonsteknologinb_NO
dc.description.degreePhD in Information Technologyen_GB


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel