Header logo is


2011


no image
Optimal Reinforcement Learning for Gaussian Systems

Hennig, P.

In Advances in Neural Information Processing Systems 24, pages: 325-333, (Editors: J Shawe-Taylor and RS Zemel and P Bartlett and F Pereira and KQ Weinberger), Twenty-Fifth Annual Conference on Neural Information Processing Systems (NIPS), 2011 (inproceedings)

Abstract
The exploration-exploitation trade-off is among the central challenges of reinforcement learning. The optimal Bayesian solution is intractable in general. This paper studies to what extent analytic statements about optimal learning are possible if all beliefs are Gaussian processes. A first order approximation of learning of both loss and dynamics, for nonlinear, time-varying systems in continuous time and space, subject to a relatively weak restriction on the dynamics, is described by an infinite-dimensional partial differential equation. An approximate finitedimensional projection gives an impression for how this result may be helpful.

ei pn

PDF Web [BibTex]

2011


PDF Web [BibTex]


no image
Design and analysis of a magnetically actuated and compliant capsule endoscopic robot

Yim, S., Sitti, M.

In Robotics and Automation (ICRA), 2011 IEEE International Conference on, pages: 4810-4815, 2011 (inproceedings)

pi

[BibTex]

[BibTex]


no image
Micro-scale propulsion using multiple flexible artificial flagella

Singleton, J., Diller, E., Andersen, T., Regnier, S., Sitti, M.

In Intelligent Robots and Systems (IROS), 2011 IEEE/RSJ International Conference on, pages: 1687-1692, 2011 (inproceedings)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
Control of multiple heterogeneous magnetic micro-robots on non-specialized surfaces

Diller, E., Floyd, S., Pawashe, C., Sitti, M.

In Robotics and Automation (ICRA), 2011 IEEE International Conference on, pages: 115-120, 2011 (inproceedings)

pi

[BibTex]

[BibTex]


no image
Tip based robotic precision micro/nanomanipulation systems

Onal, C., Sumer, B., Ozcan, O., Nain, A., Sitti, M.

In SPIE Defense, Security, and Sensing, pages: 80580M-80580M, 2011 (inproceedings)

pi

[BibTex]

[BibTex]


no image
Design of a miniature integrated multi-modal jumping and gliding robot

Woodward, M. A., Sitti, M.

In Intelligent Robots and Systems (IROS), 2011 IEEE/RSJ International Conference on, pages: 556-561, 2011 (inproceedings)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
Free flight simulations and pitch and roll control experiments of a sub-gram flapping-flight micro aerial vehicle

Hines, L. L., Arabagi, V., Sitti, M.

In Robotics and Automation (ICRA), 2011 IEEE International Conference on, pages: 1-7, 2011 (inproceedings)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
Chemotactic behavior and dynamics of bacteria propelled microbeads

Kim, Dongwook, Liu, Albert, Stitti, Metin

In Intelligent Robots and Systems (IROS), 2011 IEEE/RSJ International Conference on, pages: 1674-1679, 2011 (inproceedings)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
Under-actuated tank-like climbing robot with various transitioning capabilities

Seo, T., Sitti, M.

In Robotics and Automation (ICRA), 2011 IEEE International Conference on, pages: 777-782, 2011 (inproceedings)

pi

[BibTex]

[BibTex]


no image
Rotating magnetic micro-robots for versatile non-contact fluidic manipulation of micro-objects

Diller, E., Ye, Z., Sitti, M.

In Intelligent Robots and Systems (IROS), 2011 IEEE/RSJ International Conference on, pages: 1291-1296, 2011 (inproceedings)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
Assembly and disassembly of magnetic mobile micro-robots towards deterministic 2-D reconfigurable micro-systems

Pawashe, C., Diller, E., Floyd, S., Sitti, M.

In Robotics and Automation (ICRA), 2011 IEEE International Conference on, pages: 261-266, 2011 (inproceedings)

pi

[BibTex]

[BibTex]


no image
Stochastic dynamics of bacteria propelled spherical micro-robots

Arabagi, V., Behkam, B., Sitti, M.

In Intelligent Robots and Systems (IROS), 2011 IEEE/RSJ International Conference on, pages: 3937-3942, 2011 (inproceedings)

pi

[BibTex]

[BibTex]

2010


no image
Using an Infinite Von Mises-Fisher Mixture Model to Cluster Treatment Beam Directions in External Radiation Therapy

Bangert, M., Hennig, P., Oelfke, U.

In pages: 746-751 , (Editors: Draghici, S. , T.M. Khoshgoftaar, V. Palade, W. Pedrycz, M.A. Wani, X. Zhu), IEEE, Piscataway, NJ, USA, Ninth International Conference on Machine Learning and Applications (ICMLA), December 2010 (inproceedings)

Abstract
We present a method for fully automated selection of treatment beam ensembles for external radiation therapy. We reformulate the beam angle selection problem as a clustering problem of locally ideal beam orientations distributed on the unit sphere. For this purpose we construct an infinite mixture of von Mises-Fisher distributions, which is suited in general for density estimation from data on the D-dimensional sphere. Using a nonparametric Dirichlet process prior, our model infers probability distributions over both the number of clusters and their parameter values. We describe an efficient Markov chain Monte Carlo inference algorithm for posterior inference from experimental data in this model. The performance of the suggested beam angle selection framework is illustrated for one intra-cranial, pancreas, and prostate case each. The infinite von Mises-Fisher mixture model (iMFMM) creates between 18 and 32 clusters, depending on the patient anatomy. This suggests to use the iMFMM directly for beam ensemble selection in robotic radio surgery, or to generate low-dimensional input for both subsequent optimization of trajectories for arc therapy and beam ensemble selection for conventional radiation therapy.

ei pn

Web DOI [BibTex]

2010


Web DOI [BibTex]


no image
Coherent Inference on Optimal Play in Game Trees

Hennig, P., Stern, D., Graepel, T.

In JMLR Workshop and Conference Proceedings Volume 9: AISTATS 2010, pages: 326-333, (Editors: Teh, Y.W. , M. Titterington ), JMLR, Cambridge, MA, USA, Thirteenth International Conference on Artificial Intelligence and Statistics, May 2010 (inproceedings)

Abstract
Round-based games are an instance of discrete planning problems. Some of the best contemporary game tree search algorithms use random roll-outs as data. Relying on a good policy, they learn on-policy values by propagating information upwards in the tree, but not between sibling nodes. Here, we present a generative model and a corresponding approximate message passing scheme for inference on the optimal, off-policy value of nodes in smooth AND/OR trees, given random roll-outs. The crucial insight is that the distribution of values in game trees is not completely arbitrary. We define a generative model of the on-policy values using a latent score for each state, representing the value under the random roll-out policy. Inference on the values under the optimal policy separates into an inductive, pre-data step and a deductive, post-data part. Both can be solved approximately with Expectation Propagation, allowing off-policy value inference for any node in the (exponentially big) tree in linear time.

ei pn

PDF Web [BibTex]

PDF Web [BibTex]


no image
Adhesion recovery and passive peeling in a wall climbing robot using adhesives

Kute, C., Murphy, M. P., Mengüç, Y., Sitti, M.

In Robotics and Automation (ICRA), 2010 IEEE International Conference on, pages: 2797-2802, 2010 (inproceedings)

pi

[BibTex]

[BibTex]


no image
Comparison of linear and nonlinear buck converter models with varying compensator gain values for design optimization

Sattler, Michael, Lui, Yusi, Edrington, Chris S

In North American Power Symposium (NAPS), 2010, pages: 1-7, 2010 (inproceedings)

pi

[BibTex]

[BibTex]


no image
Enhancing the performance of Bio-inspired adhesives

Chung, H., Glass, P., Sitti, M., Washburn, N. R.

In ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 240, 2010 (inproceedings)

pi

[BibTex]

[BibTex]


no image
Control performance simulation in the design of a flapping wing micro-aerial vehicle

Hines, L. L., Arabagi, V., Sitti, M.

In Intelligent Robots and Systems (IROS), 2010 IEEE/RSJ International Conference on, pages: 1090-1095, 2010 (inproceedings)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
Surface tension driven water strider robot using circular footpads

Ozcan, O., Wang, H., Taylor, J. D., Sitti, M.

In Robotics and Automation (ICRA), 2010 IEEE International Conference on, pages: 3799-3804, 2010 (inproceedings)

pi

[BibTex]

[BibTex]

1999


no image
Tele-touch feedback of surfaces at the micro/nano scale: Modeling and experiments

Sitti, M., Horighuchi, S., Hashimoto, H.

In Intelligent Robots and Systems, 1999. IROS’99. Proceedings. 1999 IEEE/RSJ International Conference on, 2, pages: 882-888, 1999 (inproceedings)

pi

[BibTex]

1999


[BibTex]


no image
Challenge to micro/nanomanipulation using atomic force microscope

Hashimoto, H., Sitti, M.

In Micromechatronics and Human Science, 1999. MHS’99. Proceedings of 1999 International Symposium on, pages: 35-42, 1999 (inproceedings)

pi

[BibTex]

[BibTex]


no image
Visualization interface for AFM-based nano-manipulation

Horiguchi, S., Sitti, M., Hashimoto, H.

In Industrial Electronics, 1999. ISIE’99. Proceedings of the IEEE International Symposium on, 1, pages: 310-315, 1999 (inproceedings)

pi

[BibTex]

[BibTex]


no image
Tele-nanorobotics 2-d manipulation of micro/nanoparticles using afm

Sitti, M., Horiguchi, S., Hashimoto, H.

In Advanced Intelligent Mechatronics, 1999. Proceedings. 1999 IEEE/ASME International Conference on, pages: 786-786, 1999 (inproceedings)

pi

[BibTex]

[BibTex]


no image
Two-dimensional fine particle positioning using a piezoresistive cantilever as a micro/nano-manipulator

Sitti, M., Hashimoto, H.

In Robotics and Automation, 1999. Proceedings. 1999 IEEE International Conference on, 4, pages: 2729-2735, 1999 (inproceedings)

pi

[BibTex]

[BibTex]