Loading [MathJax]/extensions/MathMenu.js
Automatic Camera Selection, Shot Size, and Video Editing in Theater Multi-Camera Recordings | IEEE Journals & Magazine | IEEE Xplore

Automatic Camera Selection, Shot Size, and Video Editing in Theater Multi-Camera Recordings


Process flow of the Proposed Recording System (PRS).

Abstract:

In a non-professional environment, multi-camera recordings of theater performances or other stage shows are difficult to realize, because amateurs are usually untrained i...Show More

Abstract:

In a non-professional environment, multi-camera recordings of theater performances or other stage shows are difficult to realize, because amateurs are usually untrained in camera work and in using a vision mixing desk that mixes multiple cameras. This can be remedied by a production process with high-resolution cameras where recordings of image sections from long shots or medium-long shots are manually or automatically cropped in post-production. For this purpose, Gandhi et al. presented a single-camera system (referred to as Gandhi Recording System in the paper) that obtains close-ups from a high-resolution recording from the central perspective. The proposed system in this paper referred to as “Proposed Recording System” extends the method to four perspectives based on a Reference Recording System from professional TV theater recordings from the Ohnsorg Theater. Rules for camera selection, image cropping, and montage are derived from the Reference Recording System in this paper. For this purpose, body and pose recognition software is used and the stage action is reconstructed from the recordings into the stage set. Speakers are recognized by detecting lip movements and speaker changes are identified using audio diarization software. The Proposed Recording System proposed in this paper is practically instantiated on a school theater recording made by laymen using four 4K cameras. An automatic editing script is generated that outputs a montage of a scene. The principles can also be adapted for other recording situations with an audience, such as lectures, interviews, discussions, talk shows, gala events, award ceremonies, and the like. More than 70 % of test persons confirm in an online study the added value of the perspective diversity of four cameras of the Proposed Recording System versus the single-camera method of Gandhi et al.
Process flow of the Proposed Recording System (PRS).
Published in: IEEE Access ( Volume: 11)
Page(s): 96673 - 96692
Date of Publication: 01 September 2023
Electronic ISSN: 2169-3536

Funding Agency:

Author image of Eckhard Stoll
Audiovisual Technology Group, Technische Universität Ilmenau, Ilmenau, Germany
Audio Visual Media Center, South Westphalia University of Applied Sciences, Meschede, Germany
Eckhard Stoll received the degree in electrical engineering from the University of Karlsruhe. He carried out his first “quasi”-multi-camera productions in the 1980s. He is currently the Artistic Director of the Audio Visual Media Center, University of Applied Sciences Südwestfalen, teaches in the field of media production. He is also the Production Manager of Multi-Camera Productions. In cooperation with TU Ilmenau, he re...Show More
Eckhard Stoll received the degree in electrical engineering from the University of Karlsruhe. He carried out his first “quasi”-multi-camera productions in the 1980s. He is currently the Artistic Director of the Audio Visual Media Center, University of Applied Sciences Südwestfalen, teaches in the field of media production. He is also the Production Manager of Multi-Camera Productions. In cooperation with TU Ilmenau, he re...View more
Author image of Stephan Breide
Audio Visual Media Center, South Westphalia University of Applied Sciences, Meschede, Germany
Stephan Breide received the Ph.D. (Dr.-Ing.) degree in electrical engineering with a focus on communications engineering, communications systems engineering, and television engineering from the Technical University of Braunschweig. He teaches as a Full Professor with the South Westphalia University of Applied Sciences in the field of communication services and applications. He is currently the Head of the Audio Visual Med...Show More
Stephan Breide received the Ph.D. (Dr.-Ing.) degree in electrical engineering with a focus on communications engineering, communications systems engineering, and television engineering from the Technical University of Braunschweig. He teaches as a Full Professor with the South Westphalia University of Applied Sciences in the field of communication services and applications. He is currently the Head of the Audio Visual Med...View more
Author image of Steve Göring
Audiovisual Technology Group, Technische Universität Ilmenau, Ilmenau, Germany
Steve Göring received the B.Sc. and M.Sc. degrees in computer science from TU Ilmenau and the Ph.D. degree in visual quality prediction using machine learning, in 2022. His focus is also on data analysis problems for video quality models and video streams. In 2016, he was with the Audiovisual Technology Group. He was with the Big Data Analytics Group, Bauhaus University Weimar. He is currently working as a Computer Scient...Show More
Steve Göring received the B.Sc. and M.Sc. degrees in computer science from TU Ilmenau and the Ph.D. degree in visual quality prediction using machine learning, in 2022. His focus is also on data analysis problems for video quality models and video streams. In 2016, he was with the Audiovisual Technology Group. He was with the Big Data Analytics Group, Bauhaus University Weimar. He is currently working as a Computer Scient...View more
Author image of Alexander Raake
Audiovisual Technology Group, Technische Universität Ilmenau, Ilmenau, Germany
Alexander Raake (Member, IEEE) received the Ph.D. (Dr.-Ing.) degree from the Electrical Engineering and Information Technology Faculty, Ruhr-Universitat Bochum, in 2005, with the book Speech Quality of VoIP. He has joined TU Ilmenau, in 2015, as a Full Professor, where he heads the Audiovisual Technology Group. From 2005 to 2015, he held a Senior Researcher, an Assistant Professor, and a later Associate Professor position...Show More
Alexander Raake (Member, IEEE) received the Ph.D. (Dr.-Ing.) degree from the Electrical Engineering and Information Technology Faculty, Ruhr-Universitat Bochum, in 2005, with the book Speech Quality of VoIP. He has joined TU Ilmenau, in 2015, as a Full Professor, where he heads the Audiovisual Technology Group. From 2005 to 2015, he held a Senior Researcher, an Assistant Professor, and a later Associate Professor position...View more

Author image of Eckhard Stoll
Audiovisual Technology Group, Technische Universität Ilmenau, Ilmenau, Germany
Audio Visual Media Center, South Westphalia University of Applied Sciences, Meschede, Germany
Eckhard Stoll received the degree in electrical engineering from the University of Karlsruhe. He carried out his first “quasi”-multi-camera productions in the 1980s. He is currently the Artistic Director of the Audio Visual Media Center, University of Applied Sciences Südwestfalen, teaches in the field of media production. He is also the Production Manager of Multi-Camera Productions. In cooperation with TU Ilmenau, he researches in camera-based production technology. With one camera, four different performances of a play were recorded from four different camera positions, resulting in a multi-camera edit in post-production.
Eckhard Stoll received the degree in electrical engineering from the University of Karlsruhe. He carried out his first “quasi”-multi-camera productions in the 1980s. He is currently the Artistic Director of the Audio Visual Media Center, University of Applied Sciences Südwestfalen, teaches in the field of media production. He is also the Production Manager of Multi-Camera Productions. In cooperation with TU Ilmenau, he researches in camera-based production technology. With one camera, four different performances of a play were recorded from four different camera positions, resulting in a multi-camera edit in post-production.View more
Author image of Stephan Breide
Audio Visual Media Center, South Westphalia University of Applied Sciences, Meschede, Germany
Stephan Breide received the Ph.D. (Dr.-Ing.) degree in electrical engineering with a focus on communications engineering, communications systems engineering, and television engineering from the Technical University of Braunschweig. He teaches as a Full Professor with the South Westphalia University of Applied Sciences in the field of communication services and applications. He is currently the Head of the Audio Visual Media Center. His focus is on multimedia applications and digital communication networks and the improvement of internet coverage in fixed wired broadband.
Stephan Breide received the Ph.D. (Dr.-Ing.) degree in electrical engineering with a focus on communications engineering, communications systems engineering, and television engineering from the Technical University of Braunschweig. He teaches as a Full Professor with the South Westphalia University of Applied Sciences in the field of communication services and applications. He is currently the Head of the Audio Visual Media Center. His focus is on multimedia applications and digital communication networks and the improvement of internet coverage in fixed wired broadband.View more
Author image of Steve Göring
Audiovisual Technology Group, Technische Universität Ilmenau, Ilmenau, Germany
Steve Göring received the B.Sc. and M.Sc. degrees in computer science from TU Ilmenau and the Ph.D. degree in visual quality prediction using machine learning, in 2022. His focus is also on data analysis problems for video quality models and video streams. In 2016, he was with the Audiovisual Technology Group. He was with the Big Data Analytics Group, Bauhaus University Weimar. He is currently working as a Computer Scientist with the Audiovisual Technology Group, TU Ilmenau. His specializations are data analytics/machine learning, video quality, and distributed communication/information systems.
Steve Göring received the B.Sc. and M.Sc. degrees in computer science from TU Ilmenau and the Ph.D. degree in visual quality prediction using machine learning, in 2022. His focus is also on data analysis problems for video quality models and video streams. In 2016, he was with the Audiovisual Technology Group. He was with the Big Data Analytics Group, Bauhaus University Weimar. He is currently working as a Computer Scientist with the Audiovisual Technology Group, TU Ilmenau. His specializations are data analytics/machine learning, video quality, and distributed communication/information systems.View more
Author image of Alexander Raake
Audiovisual Technology Group, Technische Universität Ilmenau, Ilmenau, Germany
Alexander Raake (Member, IEEE) received the Ph.D. (Dr.-Ing.) degree from the Electrical Engineering and Information Technology Faculty, Ruhr-Universitat Bochum, in 2005, with the book Speech Quality of VoIP. He has joined TU Ilmenau, in 2015, as a Full Professor, where he heads the Audiovisual Technology Group. From 2005 to 2015, he held a Senior Researcher, an Assistant Professor, and a later Associate Professor positions with the An-Institut T-Laboratories, TU Berlin, a joint venture between Deutsche Telekom AG and TU Berlin, heading the Assessment of IP-Based Applications Group. From 2004 to 2005, he was a Postdoctoral Researcher with LIMSI-CNRS, Orsay, France. His research interests include audiovisual and multimedia technology, speech, audio, and video signals, human audiovisual perception, and quality of experience. Since 1999, he has been involved with the ITU-T Study Group 12’s Standardization work on QoS and QoE assessment methods. He is a member of the Acoustical Society of America, the AES, VDE/ITG, and DEGA.
Alexander Raake (Member, IEEE) received the Ph.D. (Dr.-Ing.) degree from the Electrical Engineering and Information Technology Faculty, Ruhr-Universitat Bochum, in 2005, with the book Speech Quality of VoIP. He has joined TU Ilmenau, in 2015, as a Full Professor, where he heads the Audiovisual Technology Group. From 2005 to 2015, he held a Senior Researcher, an Assistant Professor, and a later Associate Professor positions with the An-Institut T-Laboratories, TU Berlin, a joint venture between Deutsche Telekom AG and TU Berlin, heading the Assessment of IP-Based Applications Group. From 2004 to 2005, he was a Postdoctoral Researcher with LIMSI-CNRS, Orsay, France. His research interests include audiovisual and multimedia technology, speech, audio, and video signals, human audiovisual perception, and quality of experience. Since 1999, he has been involved with the ITU-T Study Group 12’s Standardization work on QoS and QoE assessment methods. He is a member of the Acoustical Society of America, the AES, VDE/ITG, and DEGA.View more

References

References is not available for this document.