Bikramjit Banerjee's Publications

• Selected Publications • All Sorted by Date • All Classified by Publication Type •

All Classified by Publication Type

• Journal • Chapter in Edited Volume • Refereed Conference • Refereed Workshop • Unspecified •

Journal

Trung Nguyen and Bikramjit Banerjee. Reinforcement Learning as a Rehearsal for Swarm Foraging. Swarm Intelligence, 16(1):29–58, Springer, 2022.
Details BibTeX Download: [pdf]
Saurabh Arora, Prashant Doshi, and Bikramjit Banerjee. I2RL: online inverse reinforcement learning under occlusion. Autonomous Agents and Multi-Agent Systems, (2021) 35(4), Springer, 2020.
Details BibTeX Download: [pdf]
Bikramjit Banerjee and Sneha Racharla. Human-Agent Transfer from Observations. The Knowledge Engineering Review, 36(2021, e2), Cambridge University Press, 2020.
Details BibTeX Download: [pdf]
Roi Ceren, Keyang He, Prashant Doshi, and Bikramjit Banerjee. PALO Bounds for Reinforcement Learning in Partially Observable Stochastic Games. Neurocomputing, 420(2021):36–56, Elsevier, 2020.
Details BibTeX Download: [pdf]
Bikramjit Banerjee, Syamala Vittanala, and Matthew E. Taylor. Team Learning from Human Demonstration with Coordination Confidence. The Knowledge Engineering Review, 34(e12), Cambridge University Press, 2019.
Details BibTeX Download: [pdf]
Tsz-Chiu Au, Bikramjit Banerjee, Prithviraj Dasgupta, and Peter Stone. Multirobot Systems. IEEE Intelligent Systems (Guest Editorial), 32(6):3–5, IEEE Press, 2017.
Details BibTeX Download: [pdf]
B. Banerjee and C. Davis. Multi-agent Path Finding with Persistence Conflicts. IEEE Transactions on Computational Intelligence and AI in Games, 9(4):402–409, Wiley, 2017.
Details BibTeX Download: [pdf]
D. S. Brown, J. Hudack, N. Gemelli, and B. Banerjee. Exact and Heuristic Algorithms for Risk-Aware Stochastic Physical Search. Computational Intelligence, 33(3):524–553, Wiley, 2017.
Details BibTeX Download: [pdf]
Landon Kraemer and Bikramjit Banerjee. Multi-agent reinforcement learning as a rehearsal for decentralized planning. Neurocomputing, 190:82–94, Elsevier, 2016.
Details BibTeX Download: [pdf]
B. Banerjee, J. Lyle, and L. Kraemer. The complexity of multi-agent plan recognition. Autonomous Agents and Multi-Agent Systems, 29(1):40–72, Springer, 2015.
Details BibTeX Download: [pdf]
B. Banerjee and L. Kraemer. Stackelberg Surveillance. Informatica, 39(4), 2015.
Details BibTeX Download: [pdf]
L. Kraemer and B. Banerjee. Reinforcement Learning of Informed Initial Policies for Decentralized Planning. ACM Transactions on Autonomous and Adaptive Systems (TAAS), 9(4):18:1–18:32, ACM Press, 2014.
Details BibTeX Download: [pdf]
Bikramjit Banerjee and Jing Peng. Strategic best-response learning in multi-agent systems. Journal of Experimental and Theoretical Artificial Intelligence, 24(2):139–160, Taylor and Francis, 2012.
Details BibTeX Download: [pdf]
Bikramjit Banerjee and Landon Kraemer. Action Discovery for Single and Multi-agent Reinforcement Learning. Advances in Complex Systems, 14(2):279–305, World Scientific Publishing, 2011.
Details BibTeX Download: [pdf]
Kyle Walsh and Bikramjit Banerjee. Fast A* with iterative resolution for navigation. International Journal on Artificial Intelligence Tools, 19(1):101–119, World Scientific Publishing, 2010.
Details BibTeX Download: [pdf]
Bikramjit Banerjee, Ahmed Abukmail, and Landon Kraemer. Layered Intelligence for agent-based crowd simulation. SIMULATION: Transactions of the Society for Modeling and Simulation International, 85(10):621–633, SAGE, 2009.
Details BibTeX Download: [pdf]
Bikramjit Banerjee and Jing Peng. Generalized Multiagent Learning with Performance Bound. Autonomous Agents and Multiagent Systems, 15(3):281–312, Springer, 2007.
Details BibTeX Download: [pdf]
Bikramjit Banerjee and Jing Peng. Reactivity and Safe Learning in Multiagent Systems. Adaptive Behavior, 14(4):339–356, SAGE, 2006.
Details BibTeX Download: [pdf]
Bikramjit Banerjee, Sandip Sen, and Jing Peng. On-Policy Concurrent Reinforcement Learning. Journal of Experimental and Theoretical Artificial Intelligence, 16(4):245 – 260, 2004.
Details BibTeX Download: [pdf]
D. Warner, J.N. Richter, S.D. Durbin, and B. Banerjee. Mining a FAQ Using RightNow Web. Database and Network Journal, 32(2):3–8, 2002.
Details BibTeX Download: (unavailable)
Bikramjit Banerjee, Anish Biswas, Manisha Mundhe, Sandip Debnath, and Sandip Sen. Using Bayesian Networks to Model Agent Relationships. Journal of Applied Artificial Intelligence: Special issue on "Deception, Fraud and Trust in Agent Societies, 14(9):867–880, 2000.
Details BibTeX Download: [ps]

Chapter in Edited Volume

P. Dasgupta, K. Cheng, and B. Banerjee. Adaptive Multi-robot Team Reconfiguration Using a Policy-Reuse Reinforcement Learning Approach. In Advanced Agent Technology, pp. 330–345, Springer, 2012.
Details BibTeX Download: (unavailable)
B. Banerjee and L. Kraemer. Evaluation and comparison of multi-agent based crowd simulation systems. In F. Dignum, editors, Agents for Games and Simulations II: Trends in Techniques, Concepts and Design, pp. 53–66, Springer, 2011.
Details BibTeX Download: [pdf]
B. Banerjee and J. Peng. Unifying Convergence and No-regret in Multiagent Learning. In Karl Tuyls, Pieter Jan 't Hoen, Sandip Sen, and Katja Verbeeck, editors, Learning and Adaption in Multi-Agent Systems, pp. 100 – 114, Springer-Verlag, 2006.
Details BibTeX Download: [pdf]
B. Banerjee and S. Sen. Selecting Partners. In S. Parsons, P. Gmytrasiewicz, and M. Wooldrige, editors, Game Theory and Decision Theory in Agent-based Systems, pp. 29–42, Kluwer, 2002.
Details BibTeX Download: [pdf]
R. Mukherjee, B. Banerjee, and S. Sen. Learning Mutual Trust. In R. Falcone, M. Singh, and Y. Tan, editors, Trust in Cyber-Societies: Integrating Human and Artificial Perspectives, pp. 145–158, Springer-Verlag, 2001.
Details BibTeX Download: (unavailable)

Refereed Conference

Saurabh Arora, Prashant Doshi, and Bikramjit Banerjee. Online Inverse Reinforcement Learning with Learned Observation Model. In (To appear) Proceedings of the 6th Conference on Robot Learning (CoRL 2022), Auckland, New Zealand, 2022.
Details BibTeX Download: [pdf]
Keyang He, Prashant Doshi, and Bikramjit Banerjee. Reinforcement Learning in Many-Agent Settings Under Partial Observability. In Proceedings of the 38th Conference on Uncertainty in Artificial Intelligence (UAI 2022), pp. 780–789, Eindhoven, Netherlands, 2022.
Details BibTeX Download: [pdf]
Saurabh Arora, Prashant Doshi, and Bikramjit Banerjee. Min-Max Entropy Inverse RL of Multiple Tasks. In Proceedings of the 2021 IEEE International Conference on Robotics and Automation (ICRA 2021), pp. 12639–12645, Xi'an, China, 2021.
Details BibTeX Download: [pdf]
Keyang He, Bikramjit Banerjee, and Prashant Doshi. Cooperative-Competitive Reinforcement Learning with History-Dependent Rewards. In Proceedings of the 20th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2021), pp. 602–610, 2021.
Details BibTeX Download: [pdf]
Saurabh Arora, Prashant Doshi, and Bikramjit Banerjee. Online Inverse Reinforcement Learning under Occlusion. In Proceedings of the 18th International Conference on Autonomous Agents and Multi-agent Systems (AAMAS-19), pp. 1170–1178, Montreal, Canada, 2019.
Details BibTeX Download: [pdf]
Vinamra Jain, Prashant Doshi, and Bikramjit Banerjee. Model-Free IRL using Maximum Likelihood Estimation. In Proceedings of the 33rd AAAI Conference on Artificial Intelligence (AAAI-19), pp. 3951–3958, Honolulu, HI, 2019.
Details BibTeX Download: [pdf]
B. Banerjee. Autonomous Acquisition of Behavior Trees for Robot Control. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-18), pp. 3460–3467, Madrid, Spain, 2018.
Details BibTeX Download: [pdf]
B. Banerjee, S. Loscalzo, and D.L. Thompson. Detection of Plan Deviation in Multi-Agent Systems. In Proceedings of the 30th AAAI Conference on Artificial Intelligence (AAAI-16), pp. 2445–2451, Phoenix, AZ, 2016.
Details BibTeX Download: [pdf]
R. Ceren, P. Doshi, and B. Banerjee. Reinforcement Learning in Partially Observable Multiagent Settings: Monte Carlo Exploring Policies with PAC Bounds. In Proceedings of the 15th International Conference on Autonomous Agents and Multiagent Systems (AAMAS-16), pp. 530–538, Singapore, 2016.
Details BibTeX Download: [pdf]
T. Neller, L. Brown, R. West, J. Heliotis, S. Strout, I. Bezakova, B. Banerjee, and D. Thompson. Model AI Assignments 2014. In Proceedings of Educational Advances in Artificial Intelligence (EAAI-14), pp. 3054–3056, Singapore, 2014.
Details BibTeX Download: (unavailable)
Bikramjit Banerjee. Pruning for Monte Carlo Distributed Reinforcement Learning in Decentralized POMDPs. In Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence (AAAI-13), pp. 88–94, Bellevue, WA, July 2013.
Details BibTeX Download: [pdf]
A. Kebert, B. Banerjee, G. George, J. Solano, and W. Solano. Detecting Distributed SQL injection attacks in a Eucalyptus Cloud environment. In Proceedings of the 12th International Conference on Security and Management (SAM-13), CSREA Press, Las Vegas, NV, July 2013.
Details BibTeX Download: [pdf]
Landon Kraemer and Bikramjit Banerjee. Concurrent Reinforcement Learning as a Rehearsal for Decentralized Planning Under Uncertainty (Extended Abstract). In Proceedings of the 12th International Conference on Autonomous Agents and Multi-agent Systems (AAMAS-13), pp. 1291–1292, St. Paul, MN, May 2013.
Details BibTeX Download: [pdf]
Bikramjit Banerjee, Jeremy Lyle, Landon Kraemer, and Rajesh Yellamraju. Sample Bounded Distributed Reinforcement Learning for Decentralized POMDPs. In Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence (AAAI-12), pp. 1256–1262, Toronto, Canada, July 2012.
Details BibTeX Download: [pdf]
Bikramjit Banerjee, Jeremy Lyle, and Landon Kraemer. Efficient Context Free Parsing of Multi-agent Activities for Team and Plan Recognition (Extended Abstract). In Proceedings of the 11th International Conference on Autonomous Agents and Multi-agent Systems (AAMAS-12), pp. 1441–1442, Valencia, Spain, June 2012.
Details BibTeX Download: [pdf]
Landon Kraemer and Bikramjit Banerjee. Informed Initial Policies for Learning in Dec-POMDPs. In Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence (Student Abstract, AAAI-12), pp. 2433–2434, Toronto, Canada, July 2012.
Details BibTeX Download: [pdf]
B. Banerjee and L. Kraemer. Branch and Price for Multi-Agent Plan Recognition. In Proceedings of the 25th AAAI Conference on Artificial Intelligence (AAAI-11), pp. 601–607, San Francisco, CA, 2011.
Details BibTeX Download: [pdf]
B. Banerjee, L. Kraemer, and W. Solano. Particle Filtering for Diagnosis and Prognosis of Anomalies in Rocket Engine Tests. In Proceedings of the American Institute of Aeronautics and Astronautics (AIAA) Conference on Infotech@Aerospace, St. Louis, MO, 2011.
Details BibTeX Download: [pdf]
B. Banerjee and L. Kraemer. Coalition Structure Generation in Multi-Agent Systems with Mixed Externalities. In Proceedings of the Ninth International Conference on Autonomous Agents and Multiagent Systems (AAMAS-10), pp. 175–182, Toronto, Canada, 2010.
Details BibTeX Download: [pdf]
B. Banerjee, L. Kraemer, and J. Lyle. Multi-Agent Plan Recognition: Formalization and Algorithms. In Proceedings of AAAI-10, pp. 1059–1064, Atlanta, GA, 2010.
Details BibTeX Download: [pdf]
B. Banerjee and L. Kraemer. Validation of Agent Based Crowd Egress Simulation (Extended Abstract). In Proceedings of the Ninth International Conference on Autonomous Agents and Multiagent Systems (AAMAS-10), pp. 1551–1552, Toronto, Canada, 2010.
Details BibTeX Download: [pdf]
B. Banerjee and L. Kraemer. Action Discovery for Reinforcement Learning (Extended Abstract). In Proceedings of the Ninth International Conference on Autonomous Agents and Multiagent Systems (AAMAS-10), pp. 1585–1586, Toronto, Canada, 2010.
Details BibTeX Download: [pdf]
B. Banerjee, A. Abukmail, and L. Kraemer. Advancing the layered approach to agent-based crowd simulation. In Proceedings of the 22nd ACM/IEEE/SCS Workshop on the Principles of Advanced and Distributed Simulation (PADS), pp. 185–192, Rome, Italy, 2008.
NOMINATED FOR BEST PAPER AWARD
Details BibTeX Download: [pdf]
B. Banerjee, M. Bennett, M. Johnson, and A. Ali. Congestion Avoidance in Multi-Agent-based Egress Simulation. In Proceedings of the 2008 International Conference on Artificial Intelligence (ICAI), Las Vegas, USA, 2008.
Details BibTeX Download: [pdf]
Patrick Moghames and Bikramjit Banerjee. Deconstructing a Neural Network. In Proceedings of the International Conference on Computer Games and Allied Technology (CGAT 08), pp. 296–306, Singapore, 2008.
Details BibTeX Download: (unavailable)
K.P. Walsh and B. Banerjee. Variable Resolution A*. In Proceedings of EUROSIS GAMEON-NA Conference, McGill University, Canada, 2008.
Details BibTeX Download: [pdf]
Bikramjit Banerjee and Peter Stone. General Game learning using knowledge transfer. In Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI-07), pp. 672–677, Hyderabad, India, 2007.
Details BibTeX Download: [pdf]
G. Shahine and B. Banerjee. Player Modeling using Knowledge transfer. In Proceedings of EUROSIS GAMEON-NA Conference, pp. 82–89, Gainesville, FL, 2007.
WINNER OF BEST PAPER AWARD
Details BibTeX Download: [pdf]
B. Banerjee and J. Peng. RV_sigma(t): A unifying approach to performance and convergence in online multiagent learning. In Proceedings of the Fifth International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS-06), Hakodate, Japan, 2006.
Details BibTeX Download: (unavailable)
B. Banerjee and J. Peng. Efficient No-regret multiagent learning. In Proceedings of the Twentieth National Conference on Artificial Intelligence (AAAI-05), pp. 41–46, AAAI, Pittsburgh, PA, 2005.
Details BibTeX Download: [pdf]
B. Banerjee and J. Peng. Efficient Learning of Multi-step best response. In Proceedings of the Fourth International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS-05), pp. 60–66, ACM Press, The Netherlands, 2005.
Details BibTeX Download: [pdf]
B. Banerjee and J. Peng. On the performance of on-line concurrent reinforcement learners. In Proceedings of the Fourth International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS-05), ACM Press, The Netherlands, 2005. Dissertation abstract at the Doctoral Consortium
Details BibTeX Download: (unavailable)
B. Banerjee and J. Peng. Performance bounded reinforcement learning in strategic interactions. In Proceedings of the 19th National Conference on Artificial Intelligence (AAAI-04), pp. 2 – 7, AAAI Press, San Jose, CA, 2004.
Details BibTeX Download: [pdf]
B. Banerjee and J. Peng. The Role of Reactivity in Multiagent Learning. In Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS-04), pp. 538 – 545, ACM, New York, USA, 2004.
Details BibTeX Download: [ps]
B. Banerjee and J. Peng. Adaptive policy gradient in multiagent learning. In Proceedings of 2nd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS-03), pp. 686–692, Melbourne, Australia, 2003.
Details BibTeX Download: [ps]
B. Banerjee and J. Peng. Convergent Gradient Ascent in General Sum Games. In Proceedings of 13th the European Conference on Machine Learning, pp. 1–9, Helsinki, Finland, 2002.
Details BibTeX Download: [ps]
J. Peng, B. Banerjee, and D. R. Heisterkamp. Kernel index for relevance feedback retrieval in large image databases. In Proceedings of the 1st International Conference on Fuzzy Systems and Knowledge Discovery: Computational Intelligence for the E-Age, pp. 187–191, Singapore, 2002. Also appeared in the Proceedings of the 9th International Conference on Neural Information Processing, 2002
Details BibTeX Download: (unavailable)
N. Richter, S. Durbin, B. Banerjee, Z. Gedeon, and D. Warner. Fuzzy Adaptive Clustering and Classification for Browsable Document Directories. In Proceedings of the Conference on Artificial Neural Networks in Engineering, St. Louis, Missouri, 2002.
Details BibTeX Download: [pdf]
B. Banerjee, S. Sen, and J. Peng. Fast concurrent reinforcement learners. In Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence (IJCAI-01), pp. 825–832, Morgan Kaufmann, Seattle, WA, Aug 4-10 2001.
Details BibTeX Download: [ps]
D. Warner, N. Richter, S. Durbin, and B. Banerjee. Mining clickstream data to facilitate user interaction with a customer service knowledgebase in Rightnow Web. In Proceedings of the International Conference on Knowledge Discovery and Data Mining (KDD), pp. 467–472, 2001.
Details BibTeX Download: [pdf]
B. Banerjee, S. Debnath, and S. Sen. Combining Multiple Perspectives. In Proceedings of the International Conference on Machine Learning (ICML-00), pp. 33–40, Stanford University, CA, 2000.
Details BibTeX Download: [ps]
B. Banerjee and S. Sen. Selecting Partners. In Proceedings of the Fourth International Conference on Autonomous Agents (AGENTS-00), pp. 261–262, 2000.
Details BibTeX Download: (unavailable)

Refereed Workshop

B. Banerjee and M.E. Taylor. Coordination Confidence Based Human-Multi-Agent Transfer Learning for Collaborative Teams. In Proceedings of the Adaptive Learning Agents (ALA-18) Workshop, Stockholm, Sweden, July 2018.
Details BibTeX Download: [pdf]
D. S. Brown, J. Hudack, and B. Banerjee. Algorithms for stochastic physical search on general graphs. In Proceedings of AAAI Workshop on Planning, Search, and Optimization (PLANSOPT-15), Austin, TX, January 2015.
Details BibTeX Download: (unavailable)
Bikramjit Banerjee and Landon Kraemer. Counterfactual Regret Minimization for Decentralized Planning. In Proceedings of the AAMAS-13 Workshop on Adaptive Learning Agents (ALA-13), pp. 84–91, St. Paul, MN, May 2013. Also appeared in Proceedings of the AAMAS-13 Workshop on Multiagent Sequential Decision Making (MSDM-13), pp 32--39.
Details BibTeX Download: [pdf]
Landon Kraemer and Bikramjit Banerjee. Rehearsal based multi-agent reinforcement learning of decentralized plans. In Proceedings of the 8th AAMAS-13 Workshop on Multiagent Sequential Decision Making (MSDM-13), pp. 24–31, St. Paul, MN, May 2013.
Details BibTeX Download: [pdf]
Bikramjit Banerjee, Jeremy Lyle, Landon Kraemer, and Rajesh Yellamraju. Solving Finite Horizon Decentralized POMDPs by Distributed Reinforcement Learning. In Proceedings of the AAMAS-12 Workshop on Multiagent Sequential Decision Making Under Uncertainty (MSDM-12), pp. 9–16, Valencia, Spain, June 2012.
Details BibTeX Download: [pdf]
Landon Kraemer and Bikramjit Banerjee. Informed Initial Policies for Learning in Dec-POMDPs. In Proceedings of the AAMAS-12 Workshop on Adaptive Learning Agents (ALA-12), pp. 135–143, Valencia, Spain, June 2012.
Details BibTeX Download: [pdf]
Bikramjit Banerjee, Jeremy Lyle, and Landon Kraemer. New Algorithms and Hardness Results for Multi-Agent Plan Recognition. In Proceedings of ICAPS-11 Workshop on Goal, Activity and Plan Recognition (GAPRec-11), pp. 24–31, Freiburg, Germany, 2011.
Details BibTeX Download: [pdf]
K. Cheng, P. Dasgupta, and B. Banerjee. Adaptive Multi-Robot Team Reconfiguration using a Policy-Reuse Reinforcement Learning Approach. In Proceedings of AAMAS-11 Workshop on Autonomous Robots and Multirobot Systems (ARMS-11), Taipei, Taiwan, 2011.
Details BibTeX Download: (unavailable)
Bikramjit Banerjee and Landon Kraemer. Reinforcement Learning with Action Discovery. In Proceedings of the Adaptive and Learning Agents Workshop at AAMAS-10, pp. 30–37, Toronto, Canada, May-10th 2010.
Details BibTeX Download: [pdf]
Bikramjit Banerjee and Landon Kraemer. Search performance of multi-agent plan recognition in a general model. In Proceedings of the Plan Activity and Intent Recognition (PAIR) Workshop at AAAI-10, Atlanta, GA, July-12th 2010.
Details BibTeX Download: [pdf]
B. Banerjee, Greg Kuhlmann, and Peter Stone. Value Function Transfer for General Game Playing. In Online Proceedings of the ICML-06 Workshop on Structural Knowledge Transfer for Machine Learning, CMU, Pittsburg, 2006. Held in conjunction with ICML-06
Details BibTeX Download: [pdf]
B. Banerjee and J. Peng. Convergence of no-regret learning in multiagent systems. In Proceedings of the First International Workshop on Learning and Adaptation in Multiagent Systems (LAMAS), Utrecht, The Netherlands, 2005. Held in conjunction with AAMAS-05.
Details BibTeX Download: (unavailable)
Bikramjit Banerjee and Jing Peng. Countering Deception in Multiagent Reinforcement Learning. In Proceedings of the Sixth International Workshop on Trust, Privacy, Deception, and Fraud in Agent Societies, Melbourne, Australia, July-14 2003.
Details BibTeX Download: [ps]
Bikramjit Banerjee, Rajatish Mukherjee, and Sandip Sen. Learning Mutual Trust. In Autonomous Agents 2000 Workshop Proceedings on "Deception, Fraud and Trust in Agent Societies", 2000.
Details BibTeX Download: [ps]
B. Banerjee, S. Sen, and J. Peng. Evaluating Competitive Learners. In Working notes of AGENTS-00/ECML-00 Workshop on Learning Agents, Barcelona, Spain, 2000.
Details BibTeX Download: (unavailable)
B.Banerjee, S.Debnath, and S.Sen. Using Bayesian Networks to aid Negotiations among Agents. In Working notes of the AAAI-99 workshop on Negotiation: Settling Conflicts and Identifying Opportunities, 1999.
Details BibTeX Download: [ps]

Unspecified

Bikramjit Banerjee and Rajesh Yellamraju. Pruning for Monte Carlo Distributed reinforcement learning in Decentralized POMDPs. Technical ReportThe University of Southern Mississippi, .
Details BibTeX Download: (unavailable)
Landon Kraemer and Bikramjit Banerjee. Distributed reinforcement learning for policy synchronization in infinite horizon Dec-POMDPs. Technical ReportThe University of Southern Mississippi, .
Details BibTeX Download: (unavailable)

Generated by bib2html.pl (written by Patrick Riley ) on Fri Nov 11, 2022 16:58:28