Εργαστήριο Προγραμματισμού και Τεχνολογίας Ευφυών Υπολογιστικών Συστημάτων

Μόνιμο URI για αυτήν την κοινότηταhttps://dspace.library.tuc.gr/handle/123456789/73

Νέα

Περιηγούμαι

Τώρα δείχνει 1 - 6 από 6

A bayesian personalized recommendation system
(Πολυτεχνείο Κρήτης, 2014) Babas Konstantinos; Μπαμπας Κωνσταντινος; Chalkiadakis Georgios; Χαλκιαδακης Γεωργιος; Lagoudakis Michael; Λαγουδακης Μιχαηλ; Deligiannakis Antonios; Δεληγιαννακης Αντωνιος
In this research, we propose a novel Bayesian approach for personalized recommendations. We succeed in providing recommendations that are entirely personalized, based on a user’s past item “consumptions”, building a representative user model which reflects agent’s corresponding beliefs. Having a set of items, our agent has to select the one which better matches her beliefs about a specific user, in order to recommend it and receive the corresponding reward. In our approach, we model both user preferences and items under recommendation as multivariate Gaussian distributions; and make use of Normal-Inverse Wishart priors to model the recommendation agent beliefs about user types. We interpret user ratings in an innovative way, using them to guide a Bayesian updating process that helps us both capture a user’s current mood, and maintain her overall user type. We produced several variants of our approach, and applied them in the movie recommendations domain, evaluating them on data from the MovieLens dataset. We developed a generic & domain independent system, able to face the scalability challenge and able to capture user preferences (long-term and short-term). Moreover, we dealt with the exploration vs exploitation dilemma in this domain, via the application of various exploration algorithms (e.g., VPI exploration). Ours is a completely personalized approach, which exploits Bayesian Reinforcement Learning in order to recommend an item or a top-N group of items, without the need of ratings prediction. We do not employ a Collaborative Filtering or Content-based or Preference Elicitation technique, but we are still able to provide successful recommendations. Furthermore, we tackle the famous “cold-start” problem via the use of Bayesian and VPI explorations. Our algorithms are shown to be competitive against a state-of-the-art method, which nevertheless requires a minimum set of ratings from various users to provide recommendations --- unlike our entirely personalized approach.
Extending Kouretes Statechart Editor for executing statechart-based robotic behavior models
(Πολυτεχνείο Κρήτης, 2014) Papadimitriou Georgios; Παπαδημητριου Γεωργιος; Lagoudakis Michael; Λαγουδακης Μιχαηλ; Spanoudakis Nikolaos; Σπανουδακης Νικολαος; Samoladas Vasilis; Σαμολαδας Βασιλης
The development of high-level behavior for autonomous robots is a time-consuming task even for experts. The Kouretes Statechart Editor (KSE) is a Computer-Aided Software Engineering (CASE) tool, which allows to easily specify a desired robot behavior as a statechart model utilizing a variety of base robot functionalities (vision, localization, locomotion, motion skills, communication) developed within the Monas robotic software architecture framework. This thesis presents an extension to KSE, which allows to define generic agent behaviors using automatic framework-independent code generation, as long as the underlying software framework is written in the C++ programming language. This way a user can program behaviors for physical robots or software agents that can be executed on any platform using any C++ software framework. This thesis demonstrates the transparent use of the extended KSE in the SimSpark 3D soccer simulation, the Wumpus world, and the Starcraft Broodwar strategy game.
Grammatical inference for event recognition
(Πολυτεχνείο Κρήτης, 2014) Kofinas Nikolaos; Κοφινας Νικολαος; Lagoudakis Michael; Λαγουδακης Μιχαηλ; Bletsas Aggelos; Μπλετσας Αγγελος; Garofalakis Minos; Γαροφαλακης Μινως
As robot technology finds applications in the real world (search and rescue, daily household tasks, etc.), huge amounts of data are generated during autonomous robot missions. In such applications, it is often desirable to recognize high-level events that may have occurred during a mission either online or offline. Event Recognition in robot missions currently relies on human expertise and time-consuming data annotation. A modern method to recognize events is to employ Probabilistic Context-Free Grammars (PCFGs), which are formal models that can capture complex patterns in discrete sequences and can be used to parse incoming sensor data streams in order to detect patterns that may signal the occurrence of some event of interest. Recent experimentation with such methods on data from Autonomous Underwater Vehicle (AUV) missions indicated that interesting events can be recognized by parsing sequences of sensor data using an intuitive hand-written PCFG. This thesis introduces a generic procedure which can be used to automatically construct PCFGs which encode sensor data sequences that typically appear during normal robot operation using recorded logs from past missions. The resulting PCFGs can be used to recognize abnormal events in new missions evidenced by sensor data sequences which cannot be interpreted as normal. The proposed procedure consists of two parts: (a) the transformation of sensor streams into discrete sequences either to form a training corpus offline or to generate input for online parsing and (b) a Grammatical Inference algorithm in order to learn a compact PCFG consistent with a given training corpus. The learning part relies on a local search method over the space of possible grammars using chunk and merge operations. The search method aims to find a compact grammar that also maximizes its posterior probability, in a Bayesian sense, with respect to a given training corpus. The proposed procedure is evaluated on a variety of domains ranging from data-sets generated by typical context-free grammars to data-sets generated from real robot missions (NAO robot walk and AUV navigation). The results indicate that our approach is capable of producing reliable PCFG-based event recognizers, which may yield some false positive signals, but in general succeed in capturing abnormalities.
Real-time planning and learning in the "Settlers of Catan" strategy game
(Technical University of Crete, 2014) Panousis Konstantinos; Πανουσης Κωνσταντινος-Παναγιωτης; Chalkiadakis Georgios; Χαλκιαδακης Γεωργιος; Lagoudakis Michael; Λαγουδακης Μιχαηλ; Deligiannakis Antonios; Δεληγιαννακης Αντωνιος
Ο αλγόριθμος Monte Carlo Tree Search (MCTS) είναι μια γενική μέθοδος για την λήψη βέλτιστων αποφάσεων. Η μέθοδος αξιοποιεί τη λήψη (ουσιαστικά τυχαίων) δειγμάτων από τις πιθανές ενέργειες, και δημιουργεί ένα δέντρο αποφάσεων, μέσω του οποίου αναζητείται η βέλτιστη απόφαση. Μετά την επιτυχημένη εφαρμογή της μεθόδου, στο παιχνίδι -δύο παικτών και τέλειας πληροφορίας- Go, και τις προσδοκίες που δημιούργησε, η επαρκής κατανόηση των πλεονεκτημάτων και των αδυναμιών του αλγορίθμου είναι ένα ζητούμενο. Στην εργασία αυτή, εφαρμόζουμε τον αλγόριθμο MCTS, στο επιτραπέζιο παιχνίδι στρατηγικής Άποικοι του Κατάν, ένα παιχνίδι πολλών παικτών,μη-ντετερμινιστικό και μερικώς παρατηρήσιμο. Αναπτύσσουμε και αξιολογούμε τρεις διαφορετικές παραλλαγές στο κομμάτι της δημιουργίας του δέντρου του αλγορίθμου: συγκεκριμένα τη μέθοδο UCT, τη μέθοδο Bayesian UCT και τη μέθοδο Value of Perfect Information (VPI). Οι αλγόριθμοι αυτοί κατ'ουσίαν επιχειρούν να ισορροπήσουν το δίλημμα μεταξύ εξερεύνησης (exploration) και εκμετάλλευσης(exploitation) στο συγκεκριμένο τομέα. Επιπρόσθετα, δημιουργήσαμε διάφορες ευριστικές στρατηγικές για να μπορεί ο πράκτορας μας να ανταπεξέλθει σε συγκεκριμένες καταστάσεις που μπορούν να εμφανιστούν και οι οποίες απορρέουν από τους κανόνες του παιχνιδιού· σε αντίθεση με τους περισσότερους αυτοματοποιημένους παίκτες για τους Αποίκους του Κατάν, η υλοποίηση μας προσφέρει ένα (έστω απλό) σχέδιο διαπραγμάτευσης για να έχει ο πράκτορας μας τη δυνατότητα να ανταλλάσει πόρους με άλλους παίκτες. Αξίζει να σημειωθεί ότι είναι η πρώτη φορά που η μέθοδος Bayesian UCT χρησιμοποιείται στον αλγόριθμο MCTS στο παιχνίδι Άποικοι του Κατάν και είναι επίσης η πρώτη φορά που η μέθοδος VPI χρησιμοποείται σε σύζευξη με τον αλγόριθμο MCTS γενικότερα. Δοκιμάζουμε και αξιολογούμε τους πρακτόρες μας με βάση την αποτελεσματικότητα τους σε μεταξύ τους αναμετρήσεις, αλλά και σε αναμετρήσεις τους ενάντια σε υπαρκτές υλοποιήσεις άλλων αυτόνομων πρακτόρων, συμπεριλαμβανομένης και της ισχυρότερης υπάρχουσας ευρετικής υλοποίησης αυτόνομου πράκτορα. Τα αποτελέσματα μας είναι ενθαρρυντικά, και υποδηλώνουν ότι ο αλγόριθμος MCTS μπορεί να επωφεληθεί από τις παραλλαγές που υλοποιήσαμε. Ειδικά ο πράκτορας που χρησιμοποιεί την μέθοδο VPI, εμφανίζεται να είναι αρκετά ανταγωνιστικός, και η απόδοση του μπορεί να συγκριθεί με την απόδοση άλλων υπαρκτών αυτόνομων παικτών του παιχνιδιού Άποικοι του Κατάν, παρόλο που οι υπολογιστικοί πόροι που αξιοποιεί ήταν ιδιαίτερα περιορισμένοι σε σχέση με αυτούς που αξιοποιούν οι αντίπαλοι του.
Visual color and field line recognition and exploitation for the RoboCup Standard Platform League
(Πολυτεχνείο Κρήτης, 2014) Liverios-Marinos Ioannis; Λιβεριος-Μαρινος Ιωαννης; Lagoudakis Michael; Λαγουδακης Μιχαηλ; Chalkiadakis Georgios; Χαλκιαδακης Γεωργιος; Zervakis Michalis; Ζερβακης Μιχαλης
In order to complete complex tasks, both humans and robots are bound by one of the most important senses, the visual sense, which helps them perceive the state of the environment that surrounds them. Robotic soccer, known as RoboCup, represents a complex, stochastic, real-time, multi-agent, competitive domain for autonomous robots. In such domains, the ability to understand the environment is critical for the accomplishment of the assigned task and is required for a range of activities, such as locomotion, coordination, and decision making. This thesis focuses on two specific aspects of robot visual perception, color and field line recognition, in order to complement prior work on this problem by our RoboCup team Kouretes. In RoboCup, the objects of interest are characterized by unique colors (orange ball, green field, yellow goalposts, white lines) and their recognition relies on the correct identification of image areas corresponding to the same color. However, the problem of color recognition is highly affected by the environment illumination conditions, as well as the robot's camera settings. In this thesis, we propose a new approach to color recognition, which relies on modeling on-line the signatures of the target colors in the color space under different illuminations using density estimation with Gaussian distributions and dynamically identifying and using the correct models by exploiting a metric on the most common color in the RoboCup environment (green). On the problem of field line recognition, we focus on identifying a variety of field line landmarks (straight lines, center circle, corners, T-lines), which are useful for localization. This is accomplished by searching for white pixels in the camera images, selectively keeping those that can be part of a line, and then identifying each type of line using curve fitting techniques. For each recognized line landmark, we use geometry and projection techniques to estimate its distance and bearing with respect to the robot. Our work contributes an on-line tool for color recognition and a real-time module for field line recognition appropriate for on-board execution on the Aldebaran Nao humanoid robots. The proposed methods perform reliably in most cases, failing only in extreme cases, which are typically infrequent during RoboCup games.
Συστηματική αναζήτηση και ενισχυτική μάθηση για το επιτραπέζιο παιχνίδι Backgammon
(Technical University of Crete, 2014) Tsigdinos Stylianos; Τσιγδινος Στυλιανος; Lagoudakis Michael; Λαγουδακης Μιχαηλ; Zervakis Michalis; Ζερβακης Μιχαλης; Petrakis Evripidis; Πετρακης Ευριπιδης
Τα παιχνίδια απασχολούσαν, από τότε που υπάρχει πολιτισμός, τις διανοητικές λειτουργίες του ανθρώπου. Στα πλαίσια της Τεχνητής Νοημοσύνης, η αφηρημένη φύση των παιχνιδιών καθώς και η δυσκολία επίλυσής τους τα καθιστά ένα ενδιαφέρον πεδίο μελέτης. Στην παρούσα διπλωματική εργασία υλοποιούμε ένα πράκτορα για το επιτραπέζιο παιχνίδι Backgammon καθώς και ένα γραφικό περιβάλλον στο οποίο μπορούν να διεξαχθούν παρτίδες του παιχνιδιού αυτού με αντίπαλο τον πράκτορα μας ή κάποιον άνθρωπο-παίκτη. Σκοπός μας είναι η εύρεση μιας καλής στρατηγικής (policy), η οποία θα επιτρέπει στον πράκτορά μας να αυξήσει τις πιθανότητές του, με την κατάλληλη επιλογή κινήσεων, να οδηγηθεί σε μία τερματική κατάσταση νίκης. Η στρατηγική αυτή προσδιορίζει ουσιαστικά την συμπεριφορά του πράκτορα κατά την διάρκεια του παιχνιδιού. Ο μεγάλος παράγοντας διακλάδωσης του δέντρου αναζήτησης για το παιχνίδι αυτό, που πολλές φορές μπορεί να φτάσει μέχρι και κάποιες εκατοντάδες κινήσεις, καθώς και το στοιχείο της τύχης που ενυπάρχει στη φύση του παιχνιδιού, λόγω του ότι χρησιμοποιούνται ζάρια για την υπόδειξη των δυνατών αποστάσεων στις κινήσεις των δύο αντιπάλων, αυξάνει σημαντικά την δυσκολία αναζήτησης και εύρεσης της βέλτιστης αυτής στρατηγικής. Χρησιμοποιώντας ειδικές τεχνικές αναζήτησης, όπως αυτή του αλγόριθμου MiniMax και κάποιες παραλλαγές του όπως αυτή του κλαδέματος Alpha-Beta, πετύχαμε αποδεκτές ταχύτητες αναζήτησης σε ικανοποιητικό βάθος στο δέντρο αναζήτησης του παιχνιδιού. Η συστηματική αναζήτηση σε συνδυασμό με τη χρήση τεχνικών από το πεδίο της ενισχυτικής μάθησης (Reinforcement Learning) για την εκμάθηση μιας κατάλληλης συνάρτησης αξιολόγησης μέσα από δοκιμές σε πολλές παρτίδες, οδήγησαν στην εύρεση μιας στρατηγικής, η οποία επιτρέπει στον πράκτορά μας να ανταγωνιστεί αρκετά καλούς φυσικούς αλλά και τεχνητούς παίκτες στο παιχνίδι Backgammon.

Περιηγούμαι

Πλοήγηση Εργαστήριο Προγραμματισμού και Τεχνολογίας Ευφυών Υπολογιστικών Συστημάτων ανά Συγγραφέα "Lagoudakis Michael"