Associate Professor in Computer Science, University of Warwick.
I am interested in Artificial Intelligence for social good. In my work I use reinforcement learning and game theory to design artificial agents and environments that display desirable social behaviour.
I study situations in which multiple agents take decisions in complex environments and try to exploit the information they have about the system in order to realise their objectives.
I'm particularly interested in agents that can influence one another and how to engineer systems in which this social influence yields desirable outcomes.
I'm also interested in situations in which agents have very limited knowledge of the world around and need to take a good decision anyway.
Here are links to my Google Scholar , DBLP , Research Gate profiles.Below, a few examples of my research.
This project is funded by the Leverhulme Trust (2023-2026).
We often find ourselves in a position to choose between the ethical option - when we contribute to a common good by paying some cost, for example recycling - and the selfish option - when we instead do what is convenient to us free-riding on the work of others, for example not recycling.
We know peer pressure plays a role in our decisions. But can we find out when it plays a positive role?
With Jack Bara and Giulia Andrighetto, we found that some social networks promote cooperation better than others. Especially when we are allowed to choose who to interact with, and we can do so frequently enough.
In the networks above the good (green) behaviour is quickly suppressed in a Barabasi-Albert Graph (left), but it's significantly more resistant in a closed community (right), under identical partner selection and imitation rules.
Our computational framework to study the emergence of pro-social behaviour through imitation when individuals interact in a social networks can be found here.
For a laid-back introduction to the topic check out my guest appearance in the TWS Game Theory Podcast (Podcast Episode)
Imagine to be the owner of a new and still relatively unknown restaurant. The quality of food is not spectacular and the customers you have seen so far are only limited to a tiny number of friends of yours. Your account on Tripadvisor has received no review and your financial prospects look grim at best.
There is one easy solution to your problems: you ask your friends to write an enthusiastic review for you, possibly in exchange for a free meal. After this, Tripadvisor lists your restaurant as excellent and the number of customers, together with your profit, suddenly florishes.
Systems such as Tripadvisor, where a small proportion of customers writes reviews and influences a large number of potential customers, are not "bribery-proof": each restaurant owner is able to offer a compensation - monetary or not - in exchange for positive evaluation, having an impact on the whole set of potential customers.
I proposed, with Umberto Grandi, a "bribery-proof" rating system which you can find here.
For an introduction to the topic check my talk at the Warwick AI Summit (Talk here)
You are playing the following position as Black and you have 10 seconds to move.
What would you do?
There are not many moves you can make, but each of them requires a deep calculation in order to be properly assessed. In time pressure we --- and the same goes for grandmasters and supercomputers --- make judgment calls. It might be that your first thought was to push the pawn to b2. This is the most "natural" move. After all, the pawn gets closer to the queening square b1, and after you queen the pawn you are going to have a decisive material advantage. True, but b2 happens to be the losing move. After b2 White has the surprising reply: Kf6! which forces you to queen the pawn. But then comes e7, and you are checkmated.
Now look at the following:
This position is very similar to the previous one, only White is to move and the king is on h4.
In this position, White is lost. But should he or she resign? My claim is that he or she should not resign, as long as he or she believes that the opponent will not see how to win, for instance if, after White moves the king back to g5, Black will blunder as above.
In general, what is a good way of making decisions in complicated games like Chess (or Go), where we know we cannot assess every position, but we also know that our opponents can't either?
I proposed a model for this type of decision problems which you can find here.
Group meeting at the Boiler Room, Leamington Spa - October 22nd, 2021. From left to right: Stanislav Zhydkov, Charlotte Roman, Grzegorz Lisowki, myself, Jacques Bara and Charlie Pilgrim.
Besides myself, multiagent systems researchers at Warwick include Markus Brill, Debmalya Mandal, Ramanujan Sridharan and Long Than-Tranh.
Keep an eye on PhD positions in our area!
Over the years, I taught various modules in at Warwick and Imperial College and, as a teaching assistant, at Utrecht and Luxembourg Universities.
Here I collect some gadgets which may be useful to colleagues who are teaching similar content.
This is an assignment for the Agent-Based Systems Module, designed with Charlie Pilgrim. Students build their own regret matching algorithm and use it to solve for Nash equilibrium in a game of Rock Paper Scissors.
The assignment is written as a Jupyter notebook that walks the students through building the algorithm in Python, explaining each step in an interactive way. Students write the required elements of the algorithm by completing functions and running tests to verify that the code works correctly. They are then challenged to put these functions together to complete the algorithm and use it solve Rock Paper Scissors.
As an extra challenge, students are asked to visualise the learning process. Each of the last 3 years we have presented similar versions of the notebook to computer science Masters students as a one-hour seminar, with very good feedback. We include model solutions that can be released to students after the seminar.
The work got selected as a model AI assignment at EAAI , the Symposium on Educational Advances in Artificial Intelligence.
Warwick Summer School, with Charlie Pilgrim.
Students will learn the basic methodologies for the design and the analysis of AI in complex systems with many interacting agents, ranging from competitive to cooperative interaction. The course take will be interdisciplinary, touching upon themes that are important for computer science, economics, and philosophy.
By the end of the course the students will learn how to program a strategic agent participating in an auction. During the allocated seminars time students will be receive support on the programming skills required for the task (Python), from scratch.
Here is the link to the course.
Taught at Imperial College London in 2016 with Murray Shanahan and based on Russell and Norvig's "Artificial Intelligence: a Modern Approach" (3rd edition) .
I come from Capoterra. In case you did not know, Capoterra is a town close to Cagliari, the main city of Sardinia. One of the things I'm most proud of is that I speak Sardinian, which is not, as often reported, a dialect of Italian.
George Orwell, "Politics and the English Language"
Richard Dawkins, "The Future looks Bright"