PROJECT TITLE :

Online Adaptable Learning Rates for the Game Connect-4

ABSTRACT:

Learning board games by self-play encompasses a long tradition in computational intelligence for games. Based mostly on Tesauro's seminal success with TD-Gammon in 1994, several successful agents use temporal distinction learning nowadays. However so as to achieve success with temporal difference learning on game tasks, often a careful selection of options and a massive number of coaching games is important. Even for board games of moderate complexity like Connect-4, we found in previous work that a very made initial feature set and several numerous game plays are needed. In this work we tend to investigate completely different approaches of online-adaptable learning rates like Incremental Delta Bar Delta (IDBD) or temporal coherence learning (TCL) whether or not they need the potential to speed up learning for such a advanced task. We propose a new variant of TCL with geometric step size changes. We tend to compare those algorithms with several different state-of-the-art learning rate adaptation algorithms and perform a case study on the sensitivity with respect to their meta parameters. We show that during this set of learning algorithms those with geometric step size changes outperform those different algorithms with constant step size changes. Algorithms with nonlinear output functions are slightly better than linear ones. Algorithms with geometric step size changes learn faster by a factor of four as compared to previously printed results on the task Connect-4.


Did you like this research project?

To get this research project Guidelines, Training and Code... Click Here


PROJECT TITLE : LibRoad: Rapid, Online, and Accurate Detection of TPLs on Android ABSTRACT: The detection of third-party libraries, also known as TPLs, is an extremely important part of Android malware analysis. The signature-based
PROJECT TITLE : Imitation Learning Enabled Task Scheduling for Online Vehicular Edge Computing ABSTRACT: The term "vehicular edge computing" (VEC) refers to a potentially useful paradigm that is based on the Internet of vehicles
PROJECT TITLE : Online Spatio-temporal Crowd Flow Distribution Prediction for Complex Metro System ABSTRACT: Crowd flow prediction (CFP), which is an essential part of contemporary traffic management, contributes to the success
PROJECT TITLE : ESA-Stream: Efficient Self-Adaptive Online Data Stream Clustering ABSTRACT: A wide variety of big data applications generate an enormous amount of streaming data that is high-dimensional, real-time, and constantly
PROJECT TITLE : Privacy-Preserving Diverse Keyword Search and Online Pre-Diagnosis in Cloud Computing ABSTRACT: With the development of the Mobile Healthcare Monitoring Network (MHMN), patients' data collected by body sensors

Ready to Complete Your Academic MTech Project Work In Affordable Price ?

Project Enquiry