Publications

Perceiving Systems Thesis Dynamic 3D Synthesis: From Video-Based Animatable Head Avatars to Text-Guided 4D Content Creation Zheng, Y. 2025 (Published)

Abstract ›

The synthesis of 4D content—dynamic 3D content that evolves over time—has become increasingly important across a wide range of applications, including virtual communication, gaming, AR/VR, and digital content creation. Despite recent advances, generating realistic 4D content from accessible inputs remains a significant challenge. Existing approaches often rely on dense multi-camera capture systems, which are costly and impractical for everyday use, or yield results with limited geometric and visual fidelity. This thesis investigates two sub tasks in 4D content creation: (1) the reconstruction of high-fidelity, animatable head avatars from accessible inputs such as monocular RGB videos, and (2) the generation of dynamic 4D scenes from text prompts and optionally sparse visual input, such as reference images. These two directions are unified by a common goal—enabling controllable and high-quality 4D content creation from minimal visual supervision. The first part of this thesis presents IMavatar, a morphable implicit surface representation for reconstructing personalized head avatars from monocular videos. Implicit surfaces provide topological flexibility and can recover detailed 3D geometry directly from RGB images, making them well-suited for head avatar reconstruction. However, modeling expression- and pose-dependent deformations in an interpretable and generalizable way remains a major challenge when working with implicit representations. Inspired by 3D morphable models, IMavatar models deformation by learning expression blendshapes and skinning weight fields in a canonical space, enabling structured and generalizable control over novel expressions and poses. To enable end-to-end optimization from monocular videos, we propose a novel analytical gradient formulation that supports joint training of the geometry and deformation directly from RGB supervision. By combining the geometric fidelity of neural implicit fields with the controllability of morphable models, IMavatar achieves high-quality 4D reconstructions and strong generalization to unseen expressions and head poses. The second part of this thesis presents PointAvatar, a deformable point-based representation for animatable 3D head avatars. While implicit representations are effective at learning detailed geometry from image observations, they are inherently difficult to animate and computationally expensive to render. To address these limitations, this work explores point clouds as the underlying geometric representation for head avatars, offering the efficiency of explicit representations while avoiding the fixed-topology constraints of meshes. PointAvatar uses a canonical point cloud combined with learned blendshape and skinning weight fields, and further disentangles intrinsic albedo from view-dependent shading to support relighting under novel illumination. To improve training stability and reconstruction quality, we adopt a coarse-to-fine strategy that gradually increases point cloud resolution during learning. This enables the model to effectively capture accurate geometry and high-quality texture from monocular RGB videos, including challenging cases such as eyeglasses and complex hairstyles. Compared to IMavatar, PointAvatar achieves an 8× speed-up during training and a 100× speed-up during inference rendering, while maintaining high visual and geometric quality. In the final part, this thesis explores Dream-in-4D, a diffusion-guided framework for generating creative 4D content from natural language. The focus is on synthesizing imaginative 4D scenes from minimal visual input—either a single image or no visual input at all. To this end, the method leverages prior knowledge from pre-trained image and video diffusion models to optimize a 4D representation. Dream-in-4D follows a two-stage pipeline. In the first stage, a static 3D model is optimized as a neural radiance field using guidance from both image and 3D-aware diffusion models, resulting in high-quality, view-consistent assets. In the second stage, a time-dependent, multi-resolution deformation field is introduced to represent motion and is optimized using video diffusion guidance, equipping the static 3D asset with detailed and plausible motion driven by text prompts. The resulting system supports text-to-4D, image-to-4D, and personalized 4D generation within a unified framework, enabling intuitive and flexible dynamic scene synthesis from highly accessible inputs. Together, these methods address two essential aspects of 4D content creation: the reconstruction of animatable head avatars from monocular videos, and the generation of dynamic, imaginative 4D scenes from text and image prompts. We hope these contributions advance the field toward more accessible, controllable, and high-quality 4D content creation—enabling a broad range of applications across research, industry, and creative practice.

DOI URL BibTeX

Haptic Intelligence Bachelor Thesis Kalman Filter Approach to Sensor Fusion of Ultra-Wideband Positioning and IMU Readings for Enhanced Indoor Tracking of Collaborating Humans Hudhud Mughrabi, M. Kadir Has University, Istanbul, Turkey, June 2024, Bachelor of Science (BSc) in Mechatronics Engineering (Published)

Abstract ›

The question of how humans collaborate to perform complex tasks such as surgery has previously been investigated via multimodal sensing and analysis. Ultra-wideband (UWB) localization systems can be deployed to track collaborating team members due to good maneuverability even in cramped environments. However, UWB systems' sampling rate is inversely proportional to the number of people tracked, and their accuracy is hindered by electromagnetic occlusion. This thesis combines UWB positioning with measurements from a wearable inertial measurement unit (IMU) by applying an error-state extended Kalman filter (ES-EKF) to improve position and orientation estimation during team collaborative studies. ES-EKF offers faster and more consistent estimation and can be estimated even without UWB input. Single-human and multi-human sessions were recorded and filtered for evaluation in comparison to ground truth from optical motion capture. By integrating the IMU, the ES-EKF increases the sampling rate from 0.5–20 Hz to 100 Hz. As it is corrected in only 2 degrees of freedom (DOF), the ES-EKF yields improved results over UWB in 4 out of 6 DOF: lateral and longitudinal position and yaw and pitch orientation. Further filter design implications are suggested for future application of ES-EKF in position and orientation estimation of collaborating humans.

BibTeX

Empirical Inference Bachelor Thesis Navigating the Ocean of Biases: Political Bias Attribution in Language Models via Causal Structures Jenny, D. ETH Zurich, Switzerland, November 2023, external supervision (Published) BibTeX

Modern Magnetic Systems Thesis Voltage dependent interfacial magnetism in multilayer systems Nacke, R. Universität Stuttgart, Stuttgart, December 2020 BibTeX

Theory of Inhomogeneous Condensed Matter Bachelor Thesis Nichtgleichgewichtsdynamik einer abgekühlten kritischen Flüssigkeit mit Oberflächenfeldern unterschiedlichen Vorzeichens Dertli, D. Universität Stuttgart, Stuttgart, January 2020 BibTeX

Empirical Inference Bachelor Thesis Automatic Segmentation and Labelling for Robot Table Tennis Time Series Lutz, P. Technical University Darmstadt, Germany, August 2019 (Published) BibTeX

Theory of Inhomogeneous Condensed Matter Bachelor Thesis Controlling pattern formation in the confined Schnakenberg model Beyer, D. B. Universität Stuttgart, Stuttgart, 2019 BibTeX

Theory of Inhomogeneous Condensed Matter Bachelor Thesis Fluctuating interface with a pinning potential Pranjić, D. Universität Stuttgart, Stuttgart, 2019 BibTeX

Micro, Nano, and Molecular Systems Bachelor Thesis HPLC separation of ligand-exchanged gold clusters with atomic precision Itzigehl, S. Univ. of Stuttgart, 2019 BibTeX

Micro, Nano, and Molecular Systems Bachelor Thesis DNA-linked gold nanoclusters Hornberger, L. Univ. of Stuttgart, 2018 BibTeX

Theory of Inhomogeneous Condensed Matter Bachelor Thesis Electrostatic interaction between colloids with constant surface potentials at fluid interfaces Bebon, R. Universität Stuttgart, Stuttgart, 2018 BibTeX

Micro, Nano, and Molecular Systems Bachelor Thesis HPLC-Trennung von Gold-clustern Vogt, P. Univ. of Stuttgart, 2018 BibTeX

Theory of Inhomogeneous Condensed Matter Bachelor Thesis Monte Carlo study of colloidal structure formation at fluid interfaces Meiler, T. Universität Stuttgart, Stuttgart, 2018 BibTeX

Theory of Inhomogeneous Condensed Matter Bachelor Thesis Non-equilibrium dynamics of a binary solvent around heated colloidal particles Wilke, M. Universität Stuttgart, Stuttgart, 2018 BibTeX

Theory of Inhomogeneous Condensed Matter Bachelor Thesis Pattern forming systems under confinement Maihöfer, M. Universität Stuttgart, Stuttgart, 2018 BibTeX

Theory of Inhomogeneous Condensed Matter Bachelor Thesis Surface structure of liquid crystals Sattler, A. Universität Stuttgart, Stuttgart, 2018 BibTeX

Dynamic Locomotion Bachelor Thesis Untersuchung und Charakterisierung von Teilelementen der Modifikation im Lumbosacralbereich von Vögeln Richter, J. Hochschule Harz, 2018 BibTeX

Software Workshop Bachelor Thesis Design of a visualization scheme for functional connectivity data of Human Brain Bramlage, L. Hochschule Osnabrück - University of Applied Sciences, 2017 Bramlage_BSc_2017.pdf BibTeX

Theory of Inhomogeneous Condensed Matter Bachelor Thesis Electrostatic interaction between non-identical charged particles at an electrolyte interface Schmetzer, T. Universität Stuttgart, Stuttgart, 2017 BibTeX

Micro, Nano, and Molecular Systems Bachelor Thesis Enzyme activity and transport in biological media Troll, J. Univ. of Stuttgart, 2017 BibTeX

Theory of Inhomogeneous Condensed Matter Bachelor Thesis Non-equilibrium forces after temperature quenches in ideal fluids with conserved density Hölzl, C. Universität Stuttgart, Stuttgart, 2017 BibTeX

Micro, Nano, and Molecular Systems Bachelor Thesis Propulsion of magnetic colloids at low Reynolds number Segreto, N. Univ. of Stuttgart, 2017 BibTeX

Autonomous Motion Intelligent Control Systems Thesis Policy Search for Imitation Learning Doerr, A. University of Stuttgart, January 2015 URL BibTeX

Empirical Inference Thesis Development of advanced methods for improving astronomical images Schmeißer, N. Eberhard Karls Universität Tübingen, Germany, Eberhard Karls Universität Tübingen, Germany, 2014 BibTeX

Empirical Inference Probabilistic Numerics Thesis Camera-specific Image Denoising Schober, M. Eberhard Karls Universität Tübingen, Germany, October 2013 (Published) PDF BibTeX

Empirical Inference Thesis Automatische Seitenkettenzuordnung zur NMR Proteinstrukturaufklärung mittels ganzzahliger linearer Programmierung Hooge, J. University of Tübingen, Germany, 2012 BibTeX

Empirical Inference Probabilistic Numerics Thesis Nonparametric System Identification and Control for Periodic Error Correction in Telescopes Klenske, E. D. University of Stuttgart, 2012 (Published) PDF BibTeX

Empirical Inference Thesis Inferring High-Dimensional Causal Relations using Free Probability Theory Zscheischler, J. Humboldt Universität Berlin, Germany, August 2010 PDF BibTeX

Empirical Inference Thesis Semi-supervised Subspace Learning and Application to Human Functional Magnetic Brain Resonance Imaging Data Shelton, J. Biologische Kybernetik, Eberhard Karls Universität, Tübingen, Germany, July 2010 PDF BibTeX

Empirical Inference Thesis Quantitative Evaluation of MR-based Attenuation Correction for Positron Emission Tomography (PET) Mantlik, F. Biologische Kybernetik, Universität Mannheim, Germany, March 2010 BibTeX

Empirical Inference Thesis Detecting the mincut in sparse random graphs Köhler, R. Eberhard Karls Universität Tübingen, Germany, 2010 BibTeX

Empirical Inference Thesis Finding Gene-Gene Interactions using Support Vector Machines Rakitsch, B. Eberhard Karls Universität Tübingen, Germany, 2010 BibTeX

Empirical Inference Thesis Hierarchical Clustering and Density Estimation Based on k-nearest-neighbor graphs Drewe, P. Eberhard Karls Universität Tübingen, Germany, 2009 BibTeX

Empirical Inference Thesis Motor Control and Learning in Table Tennis Mülling, K. Eberhard Karls Universität Tübingen, Gerrmany, 2009 BibTeX

Empirical Inference Thesis Reinforcement Learning for Motor Primitives Kober, J. Biologische Kybernetik, University of Stuttgart, Stuttgart, Germany, August 2008 PDF BibTeX

Empirical Inference Thesis Asymmetries of Time Series under Inverting their Direction Peters, J. Biologische Kybernetik, University of Heidelberg, August 2008 PDF BibTeX

Empirical Inference Thesis Pairwise Correlations and Multineuronal Firing Patterns in Primary Visual Cortex Berens, P. Biologische Kybernetik, Eberhard Karls Universität Tübingen, Tübingen, Germany, April 2008 BibTeX

Empirical Inference Thesis Development and Application of a Python Scripting Framework for BCI2000 Schreiner, T. Biologische Kybernetik, Eberhard-Karls-Universität Tübingen, Tübingen, Germany, January 2008 BibTeX

Empirical Inference Thesis Statistical Learning Theory Approaches to Clustering Jegelka, S. Biologische Kybernetik, Eberhard-Karls-Universität Tübingen, Tübingen, Germany, November 2007 PDF BibTeX

Empirical Inference Thesis Error Correcting Codes for the P300 Visual Speller Biessmann, F. Biologische Kybernetik, Eberhard-Karls-Universität Tübingen, Tübingen, Germany, July 2007

Abstract ›

The aim of brain-computer interface (BCI) research is to establish a communication system based on intentional modulation of brain activity. This is accomplished by classifying patterns of brain ac- tivity, volitionally induced by the user. The BCI presented in this study is based on a classical paradigm as proposed by (Farwell and Donchin, 1988), the P300 visual speller. Recording electroencephalo- grams (EEG) from the scalp while presenting letters successively to the user, the speller can infer from the brain signal which letter the user was focussing on. Since EEG recordings are noisy, usually many repetitions are needed to detect the correct letter. The focus of this study was to improve the accuracy of the visual speller applying some basic principles from information theory: Stimulus sequences of the speller have been modi&amp;amp;amp;#64257;ed into error-correcting codes. Additionally a language model was incorporated into the probabilistic letter de- coder. Classi&amp;amp;amp;#64257;cation of single EEG epochs was less accurate using error correcting codes. However, the novel code could compensate for that such that overall, letter accuracies were as high as or even higher than for classical stimulus codes. In particular at high noise levels, error-correcting decoding achieved higher letter accuracies.

PDF BibTeX

Empirical Inference Thesis A priori Knowledge from Non-Examples Sinz, F. Biologische Kybernetik, Eberhard-Karls-Universität Tübingen, Tübingen, Germany, March 2007 PDF Web BibTeX

Empirical Inference Thesis Development of a Brain-Computer Interface Approach Based on Covert Attention to Tactile Stimuli Raths, C. University of Tübingen, Germany, University of Tübingen, Germany, January 2007 BibTeX

Empirical Inference Thesis A Machine Learning Approach for Estimating the Attenuation Map for a Combined PET/MR Scanner Hofmann, M. Biologische Kybernetik, Max-Planck Institute for Biological Cybernetics, Tübingen, Germany, 2007 BibTeX

Empirical Inference Thesis Kernel PCA for Image Compression Huhle, B. Biologische Kybernetik, Eberhard-Karls-Universität, Tübingen, Germany, April 2006 PDF BibTeX

Empirical Inference Thesis Implicit Surfaces For Modelling Human Heads Steinke, F. Biologische Kybernetik, Eberhard-Karls-Universität, Tübingen, September 2005 BibTeX

Empirical Inference Thesis Efficient Adaptive Sampling of the Psychometric Function by Maximizing Information Gain Tanner, T. Biologische Kybernetik, Eberhard-Karls University Tübingen, Tübingen, Germany, May 2005

Abstract ›

A common task in psychophysics is to measure the psychometric function. A psychometric function can be described by its shape and four parameters: offset or threshold, slope or width, false alarm rate or chance level and miss or lapse rate. Depending on the parameters of interest some points on the psychometric function may be more informative than others. Adaptive methods attempt to place trials on the most informative points based on the data collected in previous trials. A new Bayesian adaptive psychometric method placing trials by minimising the expected entropy of the posterior probabilty dis- tribution over a set of possible stimuli is introduced. The method is more flexible, faster and at least as efficient as the established method (Kontsevich and Tyler, 1999). Comparably accurate (2dB) threshold and slope estimates can be obtained after about 30 and 500 trials, respectively. By using a dynamic termination criterion the efficiency can be further improved. The method can be applied to all experimental designs including yes/no designs and allows acquisition of any set of free parameters. By weighting the importance of parameters one can include nuisance parameters and adjust the relative expected errors. Use of nuisance parameters may lead to more accurate estimates than assuming a guessed fixed value. Block designs are supported and do not harm the performance if a sufficient number of trials are performed. The method was evaluated by computer simulations in which the role of parametric assumptions, its robustness, the quality of different point estimates, the effect of dynamic termination criteria and many other settings were investigated.

BibTeX

Empirical Inference Thesis Real-Time Face Detection Kienzle, W. Biologische Kybernetik, Eberhard-Karls-Universitaet Tuebingen, Tuebingen, Germany, October 2003 BibTeX

Empirical Inference Thesis m-Alternative Forced Choice—Improving the Efficiency of the Method of Constant Stimuli Jäkel, F. Biologische Kybernetik, Graduate School for Neural and Behavioural Sciences, Tübingen, 2003 BibTeX

Empirical Inference Thesis Variationsverfahren zur Untersuchung von Grundzustandseigenschaften des Ein-Band Hubbard-Modells Eichhorn, J. Biologische Kybernetik, Technische Universität Dresden, Dresden/Germany, May 2001

Abstract ›

Using different modifications of a new variational approach, statical groundstate properties of the one-band Hubbard model such as energy and staggered magnetisation are calculated. By taking into account additional fluctuations, the method ist gradually improved so that a very good description of the energy in one and two dimensions can be achieved. After a detailed discussion of the application in one dimension, extensions for two dimensions are introduced. By use of a modified version of the variational ansatz in particular a description of the quantum phase transition for the magnetisation should be possible.

PostScript BibTeX

Thesis Change-point Detection and Kernels Methods 0 BibTeX

Research

Departments

Max Planck Research Groups

Start-Up Teams

Research Groups

People

Contact

Our Institute

Our History

Career

Doctoral Programs

Training

Service Units

Central Scientific Facilities

Workshops

Campus Services

Impact

Cooperation

Partners and Initiatives

Research

Departments

Max Planck Research Groups

Start-Up Teams

Research Groups

People

Contact

Our Institute

Our History

Career

Doctoral Programs

Training

Service Units

Central Scientific Facilities

Workshops

Campus Services

Impact

Cooperation

Partners and Initiatives

Publications

Filter by