Benjamin Van Roy
Associate Professor
Management Science and Engineering
Electrical Engineering
Office: Terman 315
Fax: 650.723.1614
Email: bvr @ stanford.edu
Jump to Van Roys's Reseach
Affiliations
Publications
- Dynamic Optimization
- Machine Learning
- Message Passing Algorithms
- Economics and Finance
- Price and Revenue Optimization
- Reputation Systems
- Complete List
Dynamic Optimization
Linear Programming Approaches
- J. Han and B. Van Roy, "Control of Diffusions via Linear Programming," to appear in a volume on stochastic programming in honor of George Dantzig, edited by Gerd Infanger.
- V. F. Farias and B. Van Roy, "An Approximate Dynamic Programming Approach to Network Revenue Management," submitted for publication.
- D. P. de Farias and B. Van Roy, "A Cost-Shaping Linear Program for Average-Cost Approximate Dynamic Programming with Performance Guarantees," Mathematics of Operations Research, Vol. 31, No. 3, pp. 597-620, 2006.
- R. Cogill, M. Rotkowitz, B. Van Roy, S. Lall, "An Approximate Dynamic Programming Approach to Decentralized Control of Stochastic Systems," Lecture Notes in Control and Information Sciences, Springer, Berlin, 2006, Vol. 329, pp. 243-256.
- V. F. Farias and B. Van Roy, "Tetris: A Study of Randomized Constraint Sampling," in Probabilistic and Randomized Methods for Design Under Uncertainty, G. Calafiore and F. Dabbene, eds., Springer-Verlag, 2006.
- D. P. de Farias and B. Van Roy, " On Constraint Sampling in the Linear Programming Approach to Approximate Dynamic Programming," Mathematics of Operations Research, Vol. 29, No. 3, August 2004, pp. 462-478.
- D. P. de Farias and B. Van Roy, "The Linear Programming Approach to Approximate Dynamic Programming," Operations Research, Vol. 51, No. 6, November-December 2003, pp. 850-865.
Approximate Value Iteration and Temporal-Difference Methods
- B. Van Roy, "On Regression-Based Stopping Times," forthcoming in Discrete Event Dynamic Systems.
- C. C. Moallemi, S. Kumar, and B. Van Roy, "Approximate and Data-Driven Dynamic Programming for Queueing Networks," submitted for publication.
-
B. Van Roy
"Performance
Loss Bounds for Approximate Value Iteration with State Aggregation,"
Mathematics of Operations Research, Vol. 31, No. 2, pp. 234-244,
2006.
- D. S. Choi and B. Van Roy, "A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning," Discrete Event Dynamic Systems, Vol. 16, No. 2, April 2006.
- B. Van Roy, " Neuro-Dynamic Programming: Overview and Recent Trends," in Handbook of Markov Decision Processes: Methods and Applications, edited by E. Feinberg and A. Shwartz, Kluwer, 2001.
- J. N. Tsitsiklis and B. Van Roy, " On Average Versus Discounted Reward Temporal-Difference Learning," Machine Learning, Vol. 49, No. 2-3, 2002, pp. 179-191.
- J. N. Tsitsiklis and B. Van Roy, "Regression Methods for Pricing Complex American-Style Options," IEEE Transactions on Neural Networks, Vol. 12, No. 4 (special issue on computational finance), July 2001, pp. 694-703.
- D. P. de Farias and B. Van Roy, " On the Existence of Fixed Points for Approximate Value Iteration and Temporal-Difference Learning," Journal of Optimization Theory and Applications, Vol. 105, No. 3, June, 2000.
- J. N. Tsitsiklis and B. Van Roy, "Average Cost Temporal-Difference Learning," Automatica, Vol. 35, No. 11, November 1999, pp. 1799-1808.
- J. N. Tsitsiklis and B. Van Roy, "Optimal Stopping of Markov Processes: Hilbert Space Theory, Approximation Algorithms, and an Application to Pricing High-Dimensional Financial Derivatives," IEEE Transactions on Automatic Control, Vol. 44, No. 10, October 1999, pp. 1840-1851.
- J. N. Tsitsiklis and B. Van Roy, "An Analysis of Temporal-Difference Learning with Function Approximation," IEEE Transactions on Automatic Control, Vol. 42, No. 5, May 1997, pp. 674-690.
- J. N. Tsitsiklis and B. Van Roy, "Feature-Based Methods for Large Scale Dynamic Programming," Machine Learning, Vol. 22, 1996, pp. 59-94.
- B. Van Roy, D. P. Bertsekas, Y. Lee, and J. N. Tsitsiklis, "A Neuro-Dynamic Programming Approach to Retailer Inventory Management," Proceedings of the IEEE Conference on Decision and Control, 1997. (full length version)
Miscellaneous
- H. Permuter, P. Cuff, B. Van Roy, and T. Weissman, "Capacity of the Trapdoor Channel with Feedback," IEEE Transactions on Information Theory, Vol. 54, No. 7, pp. 3150-3165, 2008.
- V. F. Farias and B. Van Roy, "Dynamic Pricing with a Prior on Market Response," submitted for publication.
- B. Van Roy, "A Short Proof of Optimality for the MIN Cache Replacement Algorithm," Information Processing Letters, Vol. 102, No. 2, pp. 72-73, 2007.
- G. Y. Weintraub, C. L. Benkard, and B. Van Roy, "Computational Methods for Oblivious Equilibrium," forthcoming in Operations Research.
-
G. Y. Weintraub, L. C. Benkard, and B. Van Roy,
"Markov
Perfect Industry Dynamics with
Many Firms," Econometrica, Vol. 76, No. 6, 2008, pp. 1375-1411.
[Technical Appendix]
- V. F. Farias and B. Van Roy "Approximation Algorithms for Dynamic Resource Allocation," Operations Research Letters, Vol. 34, No. 2, March 2006, pp. 180-190.
- X. Yan, P. Diaconis, P. Rusmevichientong, and B. Van Roy, "Solitaire: Man Versus Machine," Advances in Neural Information Processing Systems 17, MIT Press, 2005.
- C. C. Moallemi and B. Van Roy "Distributed Optimization in Adaptive Networks," Advances in Neural Information Processing Systems 16, MIT Press, 2004. [appendix]
- P. Rusmevichientong and B. Van Roy, "A Tractable POMDP for a Class of Sequencing Problems," Proceedings of the Conference on Uncertainty in Artificial Intelligence, 2001.
- N. O. Keohane, B. Van Roy, and R. J. Zeckhauser, "Managing the Quality of a Resource with Stock and Flow Controls," Journal of Public Economics, Vol. 91, 2007, pp. 541-569.
Machine Learning
- X. Yan and B. Van Roy, "Manipulation Robustness of Collaborative Filtering Systems," submitted for publication.
- V. F. Farias, C. C. Moallemi, B. Van Roy, and T. Weissman, "Universal Reinforcement Learning," forthcoming in the IEEE Transactions on Information Theory.
- B. Van Roy "Performance Loss Bounds for Approximate Value Iteration with State Aggregation," Mathematics of Operations Research, Vol. 31, No. 2, pp. 234-244, 2006.
- C. C. Moallemi and B. Van Roy "Distributed Optimization in Adaptive Networks," Advances in Neural Information Processing Systems 16, MIT Press, 2004. [appendix]
- D. S. Choi and B. Van Roy, "A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning," Discrete Event Dynamic Systems, Vol. 16, No. 2, April 2006.
- J. N. Tsitsiklis and B. Van Roy, " On Average Versus Discounted Reward Temporal-Difference Learning," Machine Learning, Vol. 49, No. 2-3, 2002, pp. 179-191.
- D. P. de Farias and B. Van Roy, " On the Existence of Fixed Points for Approximate Value Iteration and Temporal-Difference Learning," Journal of Optimization Theory and Applications, Vol. 105, No. 3, June, 2000.
- J. N. Tsitsiklis and B. Van Roy, "Average Cost Temporal-Difference Learning," Automatica, Vol. 35, No. 11, November 1999, pp. 1799-1808.
- J. N. Tsitsiklis and B. Van Roy, "An Analysis of Temporal-Difference Learning with Function Approximation," IEEE Transactions on Automatic Control, Vol. 42, No. 5, May 1997, pp. 674-690.
Message Passing Algorithms
- C. C. Moallemi and B. Van Roy, "Convergence of the Min-Sum Algorithm for Convex Optimization," submitted for publication.
- C. C. Moallemi and B. Van Roy, "A Message-Passing Paradigm for Resource Allocation," submitted for publication.
- C. C. Moallemi and B. Van Roy, "Convergence of Min-Sum Message Passing for Quadratic Optimization," IEEE Transactions on Information Theory, Vol. 55, No. 5, pp. 2413-2423, 2009.
- C. C. Moallemi and B. Van Roy, "Consensus Propagation," IEEE Transactions on Information Theory, Vol. 52, No. 11, pp. 4753-4766, 2006.
- P. Rusmevichientong and B. Van Roy, " An Analysis of Belief Propagation on the Turbo Decoding Graph with Gaussian Densities," IEEE Transactions on Information Theory, Vol. 47, No. 2, pp. 745-765, 2001.
Economics and Finance
- G. Y. Weintraub, C. L. Benkard, and B. Van Roy, "Industry Dynamics: Foundations for Models with an Infinite Number of Firms," submitted for publication.
- C. C. Moallemi, B. Park, and B. Van Roy, "Strategic Execution in the Presence of an Uninformed Arbitrageur," submitted for publication.
- B. Van Roy, "On Regression-Based Stopping Times," forthcoming in Discrete Event Dynamic Systems.
- R. Johari, G. Y. Weintraub, and B. Van Roy, "Investment and Market Structure in Industries with Congestion," submitted for publication.
- J. Han and B. Van Roy, "Control of Diffusions via Linear Programming," to appear in a volume on stochastic programming in honor of George Dantzig, edited by Gerd Infanger.
- G. Y. Weintraub, C. L. Benkard, and B. Van Roy, "Computational Methods for Oblivious Equilibrium," forthcoming in Operations Research. [Matlab code (updated July 2008)]
- G. Y. Weintraub, C. L. Benkard, and B. Van Roy, "Markov Perfect Industry Dynamics with Many Firms," Econometrica, Vol. 76, No. 6, 2008, pp. 1375-1411. [Technical Appendix]
- N. O. Keohane, B. Van Roy, and R. J. Zeckhauser, "Managing the Quality of a Resource with Stock and Flow Controls," Journal of Public Economics, Vol. 91, 2007, pp. 541-569.
- J. N. Tsitsiklis and B. Van Roy, "Regression Methods for Pricing Complex American-Style Options," IEEE Transactions on Neural Networks, Vol. 12, No. 4 (special issue on computational finance), July 2001, pp. 694-703.
- J. N. Tsitsiklis and B. Van Roy,
"Optimal Stopping of
Markov Processes: Hilbert Space Theory,
Approximation Algorithms, and an
Application to Pricing High-Dimensional
Financial Derivatives,"
IEEE Transactions on Automatic Control,
Vol. 44, No. 10, October 1999, pp. 1840-1851.
Price and Revenue Optimization
- V. F. Farias and B. Van Roy, "An Approximate Dynamic Programming Approach to Network Revenue Management," submitted for publication.
- V. F. Farias and B. Van Roy, "Dynamic Pricing with a Prior on Market Response," forthcoming in Operations Research.
- P. Rusmevichientong, J. A. Salisbury, L. T. Truss, B. Van Roy, and P. W. Glynn, "Opportunities and Challenges in Using Online Preference Data for Vehicle Pricing: A Case Study at General Motors," Journal of Revenue and Pricing Management, Vol. 5, No. 1, pp. 45-61, 2006.
- P. Rusmevichientong, B. Van Roy, and P. W. Glynn, "A Non-Parametric Approach to Multi-Product Pricing," Operations Research, Vol. 54, No. 1, 2006, pp. 82-98.
Complete Publications List
- X. Yan and B. Van Roy, "Manipulation Robustness of Collaborative Filtering Systems," submitted for publication.
- G. Y. Weintraub, C. L. Benkard, and B. Van Roy, "Industry Dynamics: Foundations for Models with an Infinite Number of Firms," submitted for publication.
- X. Yan and B. Van Roy, "Reputation Markets," Proceedings of the ACM SIGCOMM 2008 Workshop on Economics of Networks, Systems, and Computation.
- C. C. Moallemi, B. Park, and B. Van Roy, "Strategic Execution in the Presence of an Uninformed Arbitrageur," submitted for publication.
- B. Van Roy, "On Regression-Based Stopping Times," forthcoming in Discrete Event Dynamic Systems.
- C. C. Moallemi and B. Van Roy, "Convergence of the Min-Sum Algorithm for Convex Optimization," submitted for publication.
- J. Han and B. Van Roy, "Control of Diffusions via Linear Programming," to appear in a volume on stochastic programming in honor of George Dantzig, edited by Gerd Infanger.
- V. F. Farias and B. Van Roy, "An Approximate Dynamic Programming Approach to Network Revenue Management," submitted for publication.
- C. C. Moallemi and B. Van Roy, "A Message-Passing Paradigm for Resource Allocation," submitted for publication.
- C. C. Moallemi, S. Kumar, and B. Van Roy, "Approximate and Data-Driven Dynamic Programming for Queueing Networks," submitted for publication.
- H. Permuter, P. Cuff, B. Van Roy, and T. Weissman, "Capacity of the Trapdoor Channel with Feedback," IEEE Transactions on Information Theory, Vol. 54, No. 7, pp. 3150-3165, 2008.
- V. F. Farias and B. Van Roy, "Dynamic Pricing with a Prior on Market Response," forthcoming in Operations Research.
- B. Van Roy, "A Short Proof of Optimality for the MIN Cache Replacement Algorithm," Information Processing Letters, Vol. 102, No. 2, pp. 72-73, 2007.
- C. C. Moallemi and B. Van Roy, "Convergence of Min-Sum Message Passing for Quadratic Optimization," IEEE Transactions on Information Theory, Vol. 55, No. 5, pp. 2413-2423, 2009.
- R. Johari, G. Y. Weintraub, and B. Van Roy, "Investment and Market Structure in Industries with Congestion," submitted for publication.
- G. Y. Weintraub, C. L. Benkard, and B. Van Roy, "Computational Methods for Oblivious Equilibrium," forthcoming in Operations Research. [Matlab code (updated July 2008)]
-
G. Y. Weintraub, C. L. Benkard, and B. Van Roy,
"Markov
Perfect Industry Dynamics with
Many Firms," Econometrica, Vol. 76, No. 6, 2008, pp. 1375-1411.
[Technical Appendix]
Preliminary version:- G. Y. Weintraub, C. L. Benkard, and B. Van Roy, "Oblivious Equilibrium: A Mean Field Approximation for Large Scale Dynamic Games," Advances in Neural Information Processing Systems 18, MIT Press, 2006.
-
C. C. Moallemi and B. Van Roy,
"Consensus
Propagation," IEEE Transactions on Information Theory,
Vol. 52, No. 11, pp. 4753-4766, 2006.
Preliminary version:- C. C. Moallemi and B. Van Roy, "Consensus Propagation," Advances in Neural Information Processing Systems 18, MIT Press, 2006.
-
V. F. Farias, C. C. Moallemi, B. Van Roy, and T. Weissman,
"Universal
Reinforcement Learning," forthcoming in the IEEE Transactions on Information Theory.
Preliminary version:- V. F. Farias, C. C. Moallemi, B. Van Roy, and T. Weissman, "A Universal Scheme for Learning," Proceedings of the IEEE International Symposium on Information Theory, Adelaide, Australia, September 2005.
- H. Zhang, A. Goel, R. Govindan, K. Mason, and B. Van Roy, "Improving Eigenvector-Based Reputation Systems Against Collusion," Workshop on Algorithms and Models for the Web Graph, October 2004.
- P. Rusmevichientong, J. A. Salisbury, L. T. Truss, B. Van Roy, and P. W. Glynn, "Opportunities and Challenges in Using Online Preference Data for Vehicle Pricing: A Case Study at General Motors," Journal of Revenue and Pricing Management, Vol. 5, No. 1, pp. 45-61, 2006.
-
D. P. de Farias and B. Van Roy,
"A
Cost-Shaping Linear Program for Average-Cost Approximate Dynamic
Programming with Performance Guarantees,"
Mathematics of Operations Research, Vol. 31, No. 3, pp. 597-620,
2006.
Preliminary versions:- D. P. de Farias and B. Van Roy, "A Linear Program for Bellman Error Minimization with Performance Guarantees," Advances in Neural Information Processing Systems 17, MIT Press, 2005.
- D. P. de Farias and B. Van Roy, "Approximate Linear Programming for Average-Cost Dynamic Programming," Advances in Neural Information Processing Systems 15, MIT Press, 2003.
- V. F. Farias and B. Van Roy "Approximation Algorithms for Dynamic Resource Allocation," Operations Research Letters, Vol. 34, No. 2, March 2006, pp. 180-190.
- X. Yan, P. Diaconis, P. Rusmevichientong, and B. Van Roy, "Solitaire: Man Versus Machine," Advances in Neural Information Processing Systems 17, MIT Press, 2005.
-
R. Cogill, M. Rotkowitz, B. Van Roy, S. Lall,
"An
Approximate Dynamic Programming Approach to Decentralized Control
of Stochastic Systems,"
Lecture Notes in Control and Information Sciences,
Springer, Berlin, 2006, Vol. 329, pp. 243-256.
Preliminary version:- R. Cogill, M. Rotkowitz, B. Van Roy, S. Lall, "An Approximate Dynamic Programming Approach to Decentralized Control of Stochastic Systems," Proceedings of the Allerton Conference on Communication, Control, and Computing, 2004, pp. 1040-1049.
-
B. Van Roy
"Performance
Loss Bounds for Approximate Value Iteration with State Aggregation,"
Mathematics of Operations Research, Vol. 31, No. 2, pp. 234-244, 2006.
Preliminary version:- B. Van Roy, "TD(0) Leads to Better Policies than Approximate Value Iteration," Advances in Neural Information Processing Systems 18, MIT Press, 2006.
- V. F. Farias and B. Van Roy, "Tetris: A Study of Randomized Constraint Sampling," in Probabilistic and Randomized Methods for Design Under Uncertainty, G. Calafiore and F. Dabbene, eds., Springer-Verlag, 2006.
- W. B. Powell and B. Van Roy, "Approximate Dynamic Programming for High-Dimensional Dynamic Resource Allocation Problems," in Handbook of Learning and Approximate Dynamic Programming, edited by J. Si, A. G. Barto, W. B. Powell, and D. Wunsch, Wiley-IEEE Press, Hoboken, NJ, 2004, pp. 261-279.
-
C. C. Moallemi and B. Van Roy
"Distributed
Optimization in Adaptive Networks," Advances in Neural Information
Processing Systems 16, MIT Press, 2004.
[appendix]
Preliminary version:- C. C. Moallemi and B. Van Roy, "Decentralized Protocols for Optimization of Sensor Networks," Proceedings of Allerton 2003.
- P. Rusmevichientong, B. Van Roy, and P. W. Glynn, "A Non-Parametric Approach to Multi-Product Pricing," Operations Research, Vol. 54, No. 1, 2006, pp. 82-98.
- B. Van Roy, "Book Review: Self-Learning Control of Finite Markov Chains, by A. S. Poznyak, K. Najim, and E. Gomez-Ramirez," Automatica , Volume 39, Issue 2, February 2003, pp. 373-376.
-
D. S. Choi and B. Van Roy,
"A
Generalized Kalman Filter for Fixed Point Approximation
and Efficient Temporal-Difference Learning,"
Discrete Event Dynamic Systems, Vol. 16, No. 2, April 2006.
Preliminary version:- D. S. Choi and B. Van Roy, "A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning," Proceedings of the International Conference on Machine Learning, 2001.
- N. Agarwal, J. Basch, P. Beckmann, P. Bharti, S. Bloebaum, S. Casadei, A. Chou, P. Enge, W. Fong, N. Hathi, W. Mann, A. Sahai, J. Stone, J. Tsitsiklis, and B. Van Roy, "Algorithms for GPS Operation Indoors and Downtown," GPS Solutions, Vol. 6, No. 3, December, 2002, pp. 149-160.
- P. Rusmevichientong and B. Van Roy, "A Tractable POMDP for a Class of Sequencing Problems," Proceedings of the Conference on Uncertainty in Artificial Intelligence, 2001.
- D. P. de Farias and B. Van Roy, " On Constraint Sampling in the Linear Programming Approach to Approximate Dynamic Programming," Mathematics of Operations Research, Vol. 29, No. 3, August 2004, pp. 462-478.
-
D. P. de Farias and B. Van Roy,
"The
Linear Programming Approach to Approximate Dynamic Programming,"
Operations Research, Vol. 51, No. 6, November-December 2003,
pp. 850-865.
Preliminary version:- D.P. de Farias and B. Van Roy, "Approximate Dynamic Programming via Linear Programming," Advances in Neural Information Processing Systems 14, MIT Press, 2002.
- P. Rusmevichientong and B. Van Roy, " Decentralized Decision-Making in a Large Team with Local Information," Games and Economic Behavior , Vol. 43, No. 2, 2003, pp. 266-295.
-
N. O. Keohane, B. Van Roy, and R. J. Zeckhauser,
"Managing
the Quality of a Resource with Stock and Flow Controls,"
Journal of Public Economics, Vol. 91, 2007, pp. 541-569.
Preliminary version:- N. O. Keohane, B. Van Roy, and R. J. Zeckhauser, "The Optimal Harvesting of Environmental Bads," Proceedings of the IEEE Conference on Decision and Control, 2000.
- B. Van Roy, " Neuro-Dynamic Programming: Overview and Recent Trends," in Handbook of Markov Decision Processes: Methods and Applications, edited by E. Feinberg and A. Shwartz, Kluwer, 2001.
- J. N. Tsitsiklis and B. Van Roy, " On Average Versus Discounted Reward Temporal-Difference Learning," Machine Learning, Vol. 49, No. 2-3, 2002, pp. 179-191.
- J. N. Tsitsiklis and B. Van Roy, "Regression Methods for Pricing Complex American-Style Options," IEEE Transactions on Neural Networks, Vol. 12, No. 4 (special issue on computational finance), July 2001, pp. 694-703.
-
P. Rusmevichientong and B. Van Roy,
"
An Analysis of Belief Propagation on the Turbo Decoding
Graph with Gaussian Densities,"
IEEE Transactions on Information Theory, Vol. 47,
No. 2, pp. 745-765, 2001.
Preliminary version:- P. Rusmevichientong and B. Van Roy, "An Analysis of Turbo Decoding with Gaussian Priors," Advances in Neural Information Processing Systems 12, MIT Press, 2000.
-
D. P. de Farias and B. Van Roy,
"
On the Existence of Fixed Points for Approximate Value
Iteration and Temporal-Difference Learning,"
Journal of Optimization Theory and Applications,
Vol. 105, No. 3, June, 2000.
Preliminary versions:- D. P. de Farias and B. Van Roy, "Approximate Value Iteration with Randomized Policies," Proceedings of the IEEE Conference on Decision and Control, 2000.
- D. P. de Farias and B. Van Roy, "Approximate Value Iteration and Temporal-Difference Learning," Proceedings of the IEEE Symposium 2000 on Adaptive Systems for Signal Processing, Communications and Control, 2000.
- D. P. de Farias and B. Van Roy, "Fixed Points for Approximate Value Iteration and Temporal-Difference Learning," Proceedings of the International Conference on Machine Learning, 2000.
- J. N. Tsitsiklis and B. Van Roy,
"Average Cost
Temporal-Difference Learning," Automatica,
Vol. 35, No. 11, November 1999, pp. 1799-1808.
Preliminary version:- J. N. Tsitsiklis and B. Van Roy, "Average Cost Temporal-Difference Learning," Proceedings of the IEEE Conference on Decision and Control, 1997.
- B. Van Roy, "Temporal-Difference Learning and Applications in Finance," Computational Finance (Proceedings of the Sixth International Conference on Computational Finance, Leonard N. Stern School of Business, January 6-8, 1999). Edited by Y. S. Abu-Mostafa, B. LeBaron, A. W. Lo, and A. S. Weigend. Cambridge, MA: MIT Press, 1999.
- J. N. Tsitsiklis and B. Van Roy,
"Optimal Stopping of
Markov Processes: Hilbert Space Theory,
Approximation Algorithms, and an
Application to Pricing High-Dimensional
Financial Derivatives,"
IEEE Transactions on Automatic Control,
Vol. 44, No. 10, October 1999, pp. 1840-1851.
Preliminary versions:- J. N. Tsitsiklis and B. Van Roy, "Overview of Neuro-Dynamic Programming and a Case Study in Optimal Stopping," Proceedings of the IEEE Conference on Decision and Control, 1997.
- J. N. Tsitsiklis and B. Van Roy, "Approximate Solutions to Optimal Stopping Problems," Advances in Neural Information Processing Systems 9, MIT Press, 1997.
- J. N. Tsitsiklis and B. Van Roy,
"An Analysis of
Temporal-Difference Learning with Function Approximation,"
IEEE Transactions on Automatic Control,
Vol. 42, No. 5, May 1997, pp. 674-690.
Preliminary version:- J. N. Tsitsiklis and B. Van Roy, "Analysis of Temporal-Difference Learning with Function Approximation," Advances in Neural Information Processing Systems 9, MIT Press, 1997.
- J. N. Tsitsiklis and B. Van Roy, "Feature-Based Methods for Large Scale Dynamic Programming," Machine Learning, Vol. 22, 1996, pp. 59-94.
- B. Van Roy, D. P. Bertsekas, Y. Lee, and J. N. Tsitsiklis, "A Neuro-Dynamic Programming Approach to Retailer Inventory Management," Proceedings of the IEEE Conference on Decision and Control, 1997. (full length version)
- B. Van Roy and J. N. Tsitsiklis, " Stable Linear Approximations to Dynamic Programming for Stochastic Control Problems with Local Transitions," Advances in Neural Information Processing Systems 8, MIT Press, 1996.
-
R. Kennedy, Y. Lee, B. Van Roy, C. Reed, and R. Lippman,
Solving Data Mining Problems Through Pattern Recognition,
Prentice-Hall, 1997.
Preliminary version:- R. Kennedy, Y. Lee, C. Reed, and B. Van Roy, Solving Pattern Recognition Problems, Unica, 1995.
Theses
- B. Van Roy, "Learning and Value Function Approximation in Complex Decision Processes," PhD Thesis, Massachusetts Institute of Technology, May 1998.
- B. Van Roy, "Feature-Based Methods for Large Scale Dynamic Programming," Master's Thesis, Massachusetts Institute of Technology, January 1995.
Research Group
Current Students
Yi-Hao (Edward) Kao
Michael Padilla
Beomsoo Park
Waraporn Tongprasit
Zheng Wen
Xiang (Robbie) Yan
Former Doctoral Students
Ciamac Moallemi (EE PhD, 2007)
Assistant Professor, Graduate School of Business, Columbia University
Dissertation: A Message-Passing Paradigm for Optimization
Vivek Farias (EE PhD, 2007)
Assistant Professor, Sloan School of Management, MIT
Dissertation: Revenue Management Beyond "Estimate, Then Optimize"1
Gabriel Weintraub
(MS&E PhD, 2006)
Assistant Professor, Graduate School of Business, Columbia University
Dissertation: Industry Dynamics, Investment, and Market Structure
Jiarui (Jared) Han (Statistics PhD, 2005)
Two Sigma Investments
Dissertation: Dynamic Portfolio Management - An Approximate Linear
Programming Approach
Kahn Mason (MS&E PhD, 2005)
DE Shaw & Co.
Dissertation: Detecting Colluders in PageRank - Finding Slow Mixing States
in a Markov Chain
David S. Choi (EE PhD, 2003)
Postdoctoral Fellow, Harvard University
Dissertation: Optimization for Value Function Approximation
Paat Rusmevichientong (MS&E
PhD, 2003)
Assistant Professor, Operations Research and Industrial Engineering, Cornell University
Dissertation: A Non-Parametric Approach to Multi-Product Pricing2
Daniela P. de Farias (MS&E PhD, 2002)
Assistant Professor, Mechanical Engineering, MIT
Dissertation: The Linear Programming Approach to Approximate Dynamic
Programming3
Former MS Students
Nick Choo (MS&E MS, 2006)
GMO Investment Management
Former BS Honors Thesis Students
Dave Mulford (CS BS, 2002)
Thesis: Server Allocation in Web Server Farms
Xiang (Robbie) Yan (CS BS, 2005)
Thesis: Transaction-Cost-Conscious Pairs Trading Via Approximate Dynamic
Programming
2 Awarded 2003 INFORMS Dantzig Dissertation Award
3 Awarded 2002 INFORMS Dantzig Dissertation Award
