Header logo is


2016


no image
Consistent Kernel Mean Estimation for Functions of Random Variables

Simon-Gabriel*, C. J., Ścibior*, A., Tolstikhin, I., Schölkopf, B.

Advances in Neural Information Processing Systems 29, pages: 1732-1740, (Editors: D. D. Lee, M. Sugiyama, U. V. Luxburg, I. Guyon, and R. Garnett), Curran Associates, Inc., 30th Annual Conference on Neural Information Processing Systems, December 2016, *joint first authors (conference)

ei

link (url) Project Page Project Page Project Page [BibTex]

2016


link (url) Project Page Project Page Project Page [BibTex]


no image
Understanding Probabilistic Sparse Gaussian Process Approximations

Bauer, M., van der Wilk, M., Rasmussen, C. E.

Advances in Neural Information Processing Systems 29, pages: 1533-1541, (Editors: D. D. Lee, M. Sugiyama, U. V. Luxburg, I. Guyon, and R. Garnett), Curran Associates, Inc., 30th Annual Conference on Neural Information Processing Systems, December 2016 (conference)

ei

link (url) Project Page [BibTex]

link (url) Project Page [BibTex]


no image
Minimax Estimation of Maximum Mean Discrepancy with Radial Kernels

Tolstikhin, I., Sriperumbudur, B. K., Schölkopf, B.

Advances in Neural Information Processing Systems 29, pages: 1930-1938, (Editors: D. D. Lee, M. Sugiyama, U. V. Luxburg, I. Guyon, and R. Garnett), Curran Associates, Inc., 30th Annual Conference on Neural Information Processing Systems, December 2016 (conference)

ei

link (url) Project Page [BibTex]

link (url) Project Page [BibTex]


no image
Local-utopia Policy Selection for Multi-objective Reinforcement Learning

Parisi, S., Blank, A., Viernickel, T., Peters, J.

In IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), pages: 1-7, IEEE, December 2016 (inproceedings)

ei

DOI [BibTex]

DOI [BibTex]


no image
Lifelong Learning with Weighted Majority Votes

Pentina, A., Urner, R.

Advances in Neural Information Processing Systems 29, pages: 3612-3620, (Editors: D. D. Lee, M. Sugiyama, U. V. Luxburg, I. Guyon, and R. Garnett), Curran Associates, Inc., 30th Annual Conference on Neural Information Processing Systems, December 2016 (conference)

ei

link (url) Project Page [BibTex]

link (url) Project Page [BibTex]


no image
Active Nearest-Neighbor Learning in Metric Spaces

Kontorovich, A., Sabato, S., Urner, R.

Advances in Neural Information Processing Systems 29, pages: 856-864, (Editors: D. D. Lee, M. Sugiyama, U. V. Luxburg, I. Guyon, and R. Garnett), Curran Associates, Inc., 30th Annual Conference on Neural Information Processing Systems, December 2016 (conference)

ei

link (url) Project Page [BibTex]

link (url) Project Page [BibTex]


no image
Predictive and Self Triggering for Event-based State Estimation

Trimpe, S.

In Proceedings of the 55th IEEE Conference on Decision and Control (CDC), pages: 3098-3105, Las Vegas, NV, USA, December 2016 (inproceedings)

am ics

arXiv PDF DOI Project Page [BibTex]

arXiv PDF DOI Project Page [BibTex]


no image
Catching heuristics are optimal control policies

Belousov, B., Neumann, G., Rothkopf, C., Peters, J.

Advances in Neural Information Processing Systems 29, pages: 1426-1434, (Editors: D. D. Lee, M. Sugiyama, U. V. Luxburg, I. Guyon, and R. Garnett), Curran Associates, Inc., 30th Annual Conference on Neural Information Processing Systems, December 2016 (conference)

ei

link (url) Project Page [BibTex]

link (url) Project Page [BibTex]


no image
Incremental Imitation Learning of Context-Dependent Motor Skills

Ewerton, M., Maeda, G., Kollegger, G., Wiemeyer, J., Peters, J.

IEEE-RAS 16th International Conference on Humanoid Robots (Humanoids), pages: 351-358, IEEE, November 2016 (conference)

ei

DOI Project Page [BibTex]

DOI Project Page [BibTex]


no image
Using Probabilistic Movement Primitives for Striking Movements

Gomez-Gonzalez, S., Neumann, G., Schölkopf, B., Peters, J.

16th IEEE-RAS International Conference on Humanoid Robots (Humanoids), pages: 502-508, November 2016 (conference)

am ei

link (url) DOI Project Page [BibTex]

link (url) DOI Project Page [BibTex]


no image
Demonstration Based Trajectory Optimization for Generalizable Robot Motions

Koert, D., Maeda, G., Lioutikov, R., Neumann, G., Peters, J.

IEEE-RAS 16th International Conference on Humanoid Robots (Humanoids), pages: 351-358, IEEE, November 2016 (conference)

ei

DOI Project Page [BibTex]

DOI Project Page [BibTex]


Thumb xl screen shot 2019 01 07 at 11.54.16
Jointly Learning Trajectory Generation and Hitting Point Prediction in Robot Table Tennis

Huang, Y., Büchler, D., Koc, O., Schölkopf, B., Peters, J.

16th IEEE-RAS International Conference on Humanoid Robots (Humanoids), pages: 650-655, November 2016 (conference)

am ei

final link (url) DOI Project Page [BibTex]

final link (url) DOI Project Page [BibTex]


no image
Deep Spiking Networks for Model-based Planning in Humanoids

Tanneberg, D., Paraschos, A., Peters, J., Rueckert, E.

IEEE-RAS 16th International Conference on Humanoid Robots (Humanoids), pages: 656-661, IEEE, November 2016 (conference)

ei

DOI Project Page [BibTex]

DOI Project Page [BibTex]


no image
Anticipative Interaction Primitives for Human-Robot Collaboration

Maeda, G., Maloo, A., Ewerton, M., Lioutikov, R., Peters, J.

AAAI Fall Symposium Series. Shared Autonomy in Research and Practice, pages: 325-330, November 2016 (conference)

ei

link (url) [BibTex]

link (url) [BibTex]


no image
The Role of Measurement Uncertainty in Optimal Control for Contact Interactions
Workshop on the Algorithmic Foundations of Robotics, pages: 22, November 2016 (conference)

Abstract
Stochastic Optimal Control (SOC) typically considers noise only in the process model, i.e. unknown disturbances. However, in many robotic applications that involve interaction with the environment, such as locomotion and manipulation, uncertainty also comes from lack of pre- cise knowledge of the world, which is not an actual disturbance. We de- velop a computationally efficient SOC algorithm, based on risk-sensitive control, that takes into account uncertainty in the measurements. We include the dynamics of an observer in such a way that the control law explicitly depends on the current measurement uncertainty. We show that high measurement uncertainty leads to low impedance behaviors, a result in contrast with the effects of process noise variance that creates stiff behaviors. Simulation results on a simple 2D manipulator show that our controller can create better interaction with the environment under uncertain contact locations than traditional SOC approaches.

am

arXiv [BibTex]

arXiv [BibTex]


no image
Unifying distillation and privileged information

Lopez-Paz, D., Schölkopf, B., Bottou, L., Vapnik, V.

International Conference on Learning Representations (ICLR), November 2016 (conference)

ei

Arxiv Project Page [BibTex]

Arxiv Project Page [BibTex]


no image
Learning High-Order Filters for Efficient Blind Deconvolution of Document Photographs

Xiao, L., Wang, J., Heidrich, W., Hirsch, M.

Computer Vision - ECCV 2016, Lecture Notes in Computer Science, LNCS 9907, Part III, pages: 734-749, (Editors: Bastian Leibe, Jiri Matas, Nicu Sebe and Max Welling), Springer, October 2016 (conference)

ei

DOI [BibTex]

DOI [BibTex]


no image
Adaptive Training Strategies for BCIs

Sharma, D., Tanneberg, D., Grosse-Wentrup, M., Peters, J., Rueckert, E.

Cybathlon Symposium, October 2016 (conference)

ei

link (url) [BibTex]

link (url) [BibTex]


Thumb xl img
Learning Where to Search Using Visual Attention

Kloss, A., Kappler, D., Lensch, H. P. A., Butz, M. V., Schaal, S., Bohg, J.

Proceedings of the IEEE/RSJ Conference on Intelligent Robots and Systems, IEEE, IROS, October 2016 (conference)

Abstract
One of the central tasks for a household robot is searching for specific objects. It does not only require localizing the target object but also identifying promising search locations in the scene if the target is not immediately visible. As computation time and hardware resources are usually limited in robotics, it is desirable to avoid expensive visual processing steps that are exhaustively applied over the entire image. The human visual system can quickly select those image locations that have to be processed in detail for a given task. This allows us to cope with huge amounts of information and to efficiently deploy the limited capacities of our visual system. In this paper, we therefore propose to use human fixation data to train a top-down saliency model that predicts relevant image locations when searching for specific objects. We show that the learned model can successfully prune bounding box proposals without rejecting the ground truth object locations. In this aspect, the proposed model outperforms a model that is trained only on the ground truth segmentations of the target object instead of fixation data.

am

Project Page [BibTex]

PDF Project Page [BibTex]


Thumb xl oxfordlight
Parameter Learning for Improving Binary Descriptor Matching

Sankaran, B., Ramalingam, S., Taguchi, Y.

In International Conference on Intelligent Robots and Systems (IROS) 2016, IEEE/RSJ International Conference on Intelligent Robots and Systems, October 2016 (inproceedings)

Abstract
Binary descriptors allow fast detection and matching algorithms in computer vision problems. Though binary descriptors can be computed at almost two orders of magnitude faster than traditional gradient based descriptors, they suffer from poor matching accuracy in challenging conditions. In this paper we propose three improvements for binary descriptors in their computation and matching that enhance their performance in comparison to traditional binary and non-binary descriptors without compromising their speed. This is achieved by learning some weights and threshold parameters that allow customized matching under some variations such as lighting and viewpoint. Our suggested improvements can be easily applied to any binary descriptor. We demonstrate our approach on the ORB (Oriented FAST and Rotated BRIEF) descriptor and compare its performance with the traditional ORB and SIFT descriptors on a wide variety of datasets. In all instances, our enhancements outperform standard ORB and is comparable to SIFT.

am

[BibTex]

[BibTex]


no image
Experiments with Hierarchical Reinforcement Learning of Multiple Grasping Policies

Osa, T., Peters, J., Neumann, G.

International Symposium on Experimental Robotics (ISER), 1, pages: 160-172, Springer Proceedings in Advanced Robotics, (Editors: Dana Kulic, Yoshihiko Nakamura, Oussama Khatib and Gentiane Venture), Springer, October 2016 (conference)

ei

DOI [BibTex]

DOI [BibTex]


no image
Stable Reinforcement Learning with Autoencoders for Tactile and Visual Data

van Hoof, H., Chen, N., Karl, M., van der Smagt, P., Peters, J.

Proceedings of the IEEE/RSJ Conference on Intelligent Robots and Systems (IROS), pages: 3928-3934, IEEE, October 2016 (conference)

ei

DOI Project Page [BibTex]

DOI Project Page [BibTex]


no image
A New Trajectory Generation Framework in Robotic Table Tennis

Koc, O., Maeda, G., Peters, J.

Proceedings of the IEEE/RSJ Conference on Intelligent Robots and Systems (IROS), pages: 3750-3756, October 2016 (conference)

am ei

link (url) DOI [BibTex]

link (url) DOI [BibTex]


Thumb xl gadde
Superpixel Convolutional Networks using Bilateral Inceptions

Gadde, R., Jampani, V., Kiefel, M., Kappler, D., Gehler, P.

In European Conference on Computer Vision (ECCV), Lecture Notes in Computer Science, Springer, 14th European Conference on Computer Vision, October 2016 (inproceedings)

Abstract
In this paper we propose a CNN architecture for semantic image segmentation. We introduce a new “bilateral inception” module that can be inserted in existing CNN architectures and performs bilateral filtering, at multiple feature-scales, between superpixels in an image. The feature spaces for bilateral filtering and other parameters of the module are learned end-to-end using standard backpropagation techniques. The bilateral inception module addresses two issues that arise with general CNN segmentation architectures. First, this module propagates information between (super) pixels while respecting image edges, thus using the structured information of the problem for improved results. Second, the layer recovers a full resolution segmentation result from the lower resolution solution of a CNN. In the experiments, we modify several existing CNN architectures by inserting our inception modules between the last CNN (1 × 1 convolution) layers. Empirical results on three different datasets show reliable improvements not only in comparison to the baseline networks, but also in comparison to several dense-pixel prediction techniques such as CRFs, while being competitive in time.

am ps

pdf supplementary poster Project Page Project Page [BibTex]

pdf supplementary poster Project Page Project Page [BibTex]


no image
Probabilistic Decomposition of Sequential Force Interaction Tasks into Movement Primitives

Manschitz, S., Gienger, M., Kober, J., Peters, J.

IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages: 3920-3927, IEEE, October 2016 (conference)

ei

DOI Project Page [BibTex]

DOI Project Page [BibTex]


Thumb xl thumb
Barrista - Caffe Well-Served

Lassner, C., Kappler, D., Kiefel, M., Gehler, P.

In ACM Multimedia Open Source Software Competition, ACM OSSC16, October 2016 (inproceedings)

Abstract
The caffe framework is one of the leading deep learning toolboxes in the machine learning and computer vision community. While it offers efficiency and configurability, it falls short of a full interface to Python. With increasingly involved procedures for training deep networks and reaching depths of hundreds of layers, creating configuration files and keeping them consistent becomes an error prone process. We introduce the barrista framework, offering full, pythonic control over caffe. It separates responsibilities and offers code to solve frequently occurring tasks for pre-processing, training and model inspection. It is compatible to all caffe versions since mid 2015 and can import and export .prototxt files. Examples are included, e.g., a deep residual network implemented in only 172 lines (for arbitrary depths), comparing to 2320 lines in the official implementation for the equivalent model.

am ps

pdf link (url) DOI Project Page [BibTex]

pdf link (url) DOI Project Page [BibTex]


no image
Multi-task logistic regression in brain-computer interfaces

Fiebig, K., Jayaram, V., Peters, J., Grosse-Wentrup, M.

6th Workshop on Brain-Machine Interface Systems at IEEE International Conference on Systems, Man, and Cybernetics (SMC 2016), pages: 002307-002312, IEEE, October 2016 (conference)

ei

link (url) DOI [BibTex]

link (url) DOI [BibTex]


no image
Active Tactile Object Exploration with Gaussian Processes

Yi, Z., Calandra, R., Veiga, F., van Hoof, H., Hermans, T., Zhang, Y., Peters, J.

IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages: 4925-4930, IEEE, October 2016 (conference)

ei

DOI Project Page [BibTex]

DOI Project Page [BibTex]


no image
On Version Space Compression

Ben-David, S., Urner, R.

Algorithmic Learning Theory - 27th International Conference (ALT), 9925, pages: 50-64, Lecture Notes in Computer Science, (Editors: Ortner, R., Simon, H. U., and Zilles, S.), September 2016 (conference)

ei

DOI Project Page [BibTex]

DOI Project Page [BibTex]


no image
Learning Probabilistic Features from EMG Data for Predicting Knee Abnormalities

Kohlschuetter, J., Peters, J., Rueckert, E.

XIV Mediterranean Conference on Medical and Biological Engineering and Computing (MEDICON), pages: 668-672, (Editors: Kyriacou, E., Christofides, S., and Pattichis, C. S.), September 2016 (conference)

ei

DOI [BibTex]

DOI [BibTex]


no image
Planning with Information-Processing Constraints and Model Uncertainty in Markov Decision Processes

Grau-Moya, J, Leibfried, F, Genewein, T, Braun, DA

Machine Learning and Knowledge Discovery in Databases, pages: 475-491, Lecture Notes in Computer Science; 9852, Springer, Cham, Switzerland, European Conference on Machine Learning and Principles and Practice of Knowledge Discovery (ECML PKDD), September 2016 (conference)

Abstract
Information-theoretic principles for learning and acting have been proposed to solve particular classes of Markov Decision Problems. Mathematically, such approaches are governed by a variational free energy principle and allow solving MDP planning problems with information-processing constraints expressed in terms of a Kullback-Leibler divergence with respect to a reference distribution. Here we consider a generalization of such MDP planners by taking model uncertainty into account. As model uncertainty can also be formalized as an information-processing constraint, we can derive a unified solution from a single generalized variational principle. We provide a generalized value iteration scheme together with a convergence proof. As limit cases, this generalized scheme includes standard value iteration with a known model, Bayesian MDP planning, and robust planning. We demonstrate the benefits of this approach in a grid world simulation.

ei

DOI [BibTex]

DOI [BibTex]


Thumb xl 2016 lightfield depth
Depth Estimation Through a Generative Model of Light Field Synthesis

Sajjadi, M. S. M., Köhler, R., Schölkopf, B., Hirsch, M.

Pattern Recognition - 38th German Conference (GCPR), 9796, pages: 426-438, Lecture Notes in Computer Science, (Editors: Rosenhahn, B. and Andres, B.), Springer International Publishing, September 2016 (conference)

ei

Arxiv Project link (url) DOI [BibTex]

Arxiv Project link (url) DOI [BibTex]


no image
Bidirektionale Interaktion zwischen Mensch und Roboter beim Bewegungslernen (BIMROB)

Kollegger, G., Ewerton, M., Peters, J., Wiemeyer, J.

11. Symposium der DVS Sportinformatik, September 2016 (conference)

ei

link (url) [BibTex]

link (url) [BibTex]


no image
A Low-cost Sensor Glove with Vibrotactile Feedback and Multiple Finger Joint and Hand Motion Sensing for Human-Robot Interaction

Weber, P., Rueckert, E., Calandra, R., Peters, J., Beckerle, P.

25th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), pages: 99-104, August 2016 (conference)

ei

DOI [BibTex]

DOI [BibTex]


no image
Experimental and causal view on information integration in autonomous agents

Geiger, P., Hofmann, K., Schölkopf, B.

Proceedings of the 6th International Workshop on Combinations of Intelligent Methods and Applications (CIMA), pages: 21-28, (Editors: Hatzilygeroudis, I. and Palade, V.), August 2016 (conference)

ei

link (url) [BibTex]

link (url) [BibTex]


no image
Manifold Gaussian Processes for Regression

Calandra, R., Peters, J., Rasmussen, C. E., Deisenroth, M. P.

International Joint Conference on Neural Networks (IJCNN), pages: 3338-3345, IEEE, July 2016 (conference)

ei

DOI [BibTex]

DOI [BibTex]


Thumb xl screen shot 2015 12 04 at 15.11.43
Robust Gaussian Filtering using a Pseudo Measurement

Wüthrich, M., Garcia Cifuentes, C., Trimpe, S., Meier, F., Bohg, J., Issac, J., Schaal, S.

In Proceedings of the American Control Conference (ACC), Boston, MA, USA, July 2016 (inproceedings)

Abstract
Most widely-used state estimation algorithms, such as the Extended Kalman Filter and the Unscented Kalman Filter, belong to the family of Gaussian Filters (GF). Unfortunately, GFs fail if the measurement process is modelled by a fat-tailed distribution. This is a severe limitation, because thin-tailed measurement models, such as the analytically-convenient and therefore widely-used Gaussian distribution, are sensitive to outliers. In this paper, we show that mapping the measurements into a specific feature space enables any existing GF algorithm to work with fat-tailed measurement models. We find a feature function which is optimal under certain conditions. Simulation results show that the proposed method allows for robust filtering in both linear and nonlinear systems with measurements contaminated by fat-tailed noise.

am ics

Web link (url) DOI Project Page [BibTex]

Web link (url) DOI Project Page [BibTex]


no image
The Mondrian Kernel

Balog, M., Lakshminarayanan, B., Ghahramani, Z., Roy, D. M., Teh, Y. W.

Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence (UAI), (Editors: Ihler, Alexander T. and Janzing, Dominik), June 2016 (conference)

ei

Arxiv link (url) Project Page [BibTex]

Arxiv link (url) Project Page [BibTex]


no image
Recovery of non-linear cause-effect relationships from linearly mixed neuroimaging data

Weichwald, S., Gretton, A., Schölkopf, B., Grosse-Wentrup, M.

Proceedings of the 6th International Workshop on Pattern Recognition in NeuroImaging (PRNI 2016), June 2016 (conference)

ei

PDF Arxiv Code DOI Project Page [BibTex]

PDF Arxiv Code DOI Project Page [BibTex]


no image
Domain Adaptation with Conditional Transferable Components

Gong, M., Zhang, K., Liu, T., Tao, D., Glymour, C., Schölkopf, B.

Proceedings of the 33nd International Conference on Machine Learning (ICML), 48, pages: 2839-2848, JMLR Workshop and Conference Proceedings, (Editors: Balcan, M.-F. and Weinberger, K. Q.), June 2016 (conference)

ei

link (url) [BibTex]

link (url) [BibTex]


no image
Learning Causal Interaction Network of Multivariate Hawkes Processes

Etesami, S., Kiyavash, N., Zhang, K., Singhal, K.

Proceedings of the 32nd Conference on Uncertainty in Artificial Intelligence (UAI), June 2016, poster presentation (conference)

ei

[BibTex]

[BibTex]


no image
Efficient Large-scale Approximate Nearest Neighbor Search on the GPU

Wieschollek, P., Wang, O., Sorkine-Hornung, A., Lensch, H. P. A.

29th IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages: 2027 - 2035, IEEE, June 2016 (conference)

ei

DOI [BibTex]

DOI [BibTex]


no image
On the Identifiability and Estimation of Functional Causal Models in the Presence of Outcome-Dependent Selection

Zhang, K., Zhang, J., Huang, B., Schölkopf, B., Glymour, C.

Proceedings of the 32nd Conference on Uncertainty in Artificial Intelligence (UAI), pages: 825-834, (Editors: Ihler, A. and Janzing, D.), AUAI Press, June 2016 (conference)

ei

link (url) [BibTex]

link (url) [BibTex]


Thumb xl screen shot 2018 10 09 at 11.42.49
Active Uncertainty Calibration in Bayesian ODE Solvers

Kersting, H., Hennig, P.

Proceedings of the 32nd Conference on Uncertainty in Artificial Intelligence (UAI), pages: 309-318, (Editors: Ihler, A. and Janzing, D.), AUAI Press, June 2016 (conference)

Abstract
There is resurging interest, in statistics and machine learning, in solvers for ordinary differential equations (ODEs) that return probability measures instead of point estimates. Recently, Conrad et al.~introduced a sampling-based class of methods that are `well-calibrated' in a specific sense. But the computational cost of these methods is significantly above that of classic methods. On the other hand, Schober et al.~pointed out a precise connection between classic Runge-Kutta ODE solvers and Gaussian filters, which gives only a rough probabilistic calibration, but at negligible cost overhead. By formulating the solution of ODEs as approximate inference in linear Gaussian SDEs, we investigate a range of probabilistic ODE solvers, that bridge the trade-off between computational cost and probabilistic calibration, and identify the inaccurate gradient measurement as the crucial source of uncertainty. We propose the novel filtering-based method Bayesian Quadrature filtering (BQF) which uses Bayesian quadrature to actively learn the imprecision in the gradient measurement by collecting multiple gradient evaluations.

ei pn

link (url) Project Page Project Page [BibTex]

link (url) Project Page Project Page [BibTex]


no image
The Arrow of Time in Multivariate Time Serie

Bauer, S., Schölkopf, B., Peters, J.

Proceedings of the 33rd International Conference on Machine Learning (ICML), 48, pages: 2043-2051, JMLR Workshop and Conference Proceedings, (Editors: Balcan, M. F. and Weinberger, K. Q.), JMLR, June 2016 (conference)

ei

link (url) [BibTex]

link (url) [BibTex]


no image
A Kernel Test for Three-Variable Interactions with Random Processes

Rubenstein, P. K., Chwialkowski, K. P., Gretton, A.

Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence (UAI), (Editors: Ihler, Alexander T. and Janzing, Dominik), June 2016 (conference)

ei

PDF Supplement Arxiv [BibTex]

PDF Supplement Arxiv [BibTex]


no image
Continuous Deep Q-Learning with Model-based Acceleration

Gu, S., Lillicrap, T., Sutskever, I., Levine, S.

Proceedings of the 33nd International Conference on Machine Learning (ICML), 48, pages: 2829-2838, JMLR Workshop and Conference Proceedings, (Editors: Maria-Florina Balcan and Kilian Q. Weinberger), JMLR.org, June 2016 (conference)

ei

link (url) Project Page [BibTex]

link (url) Project Page [BibTex]


no image
Bounded Rational Decision-Making in Feedforward Neural Networks

Leibfried, F, Braun, D

Proceedings of the 32nd Conference on Uncertainty in Artificial Intelligence (UAI), pages: 407-416, June 2016 (conference)

Abstract
Bounded rational decision-makers transform sensory input into motor output under limited computational resources. Mathematically, such decision-makers can be modeled as information-theoretic channels with limited transmission rate. Here, we apply this formalism for the first time to multilayer feedforward neural networks. We derive synaptic weight update rules for two scenarios, where either each neuron is considered as a bounded rational decision-maker or the network as a whole. In the update rules, bounded rationality translates into information-theoretically motivated types of regularization in weight space. In experiments on the MNIST benchmark classification task for handwritten digits, we show that such information-theoretic regularization successfully prevents overfitting across different architectures and attains results that are competitive with other recent techniques like dropout, dropconnect and Bayes by backprop, for both ordinary and convolutional neural networks.

ei

[BibTex]

[BibTex]