Antoine Mazières

Contact me or receive an email when I release something.

Email address



An open data platform for the French police (2022)

Talking one of the main police forces in France (Gendarmerie) into setting up an open data portal, then bundling a dozen data sources into an actual platform. Unfortunately, it seems the platform hasn't been maintained since it was released...

Website (in French) | Notes (in English) | Code


Gender bias in popular movies (2021)

This research explores computational approaches to assess gender imbalance in mainstream feature films over three decades.
It shows that the ratio of women appearing on-screen has sharply increased from 25% in 1985 to 45% in 2020 with significant disparity from one movie genre to another. The study yielded several negative results, such as no significant correlation between the ratio of women in a movie and its rating by the female audience. Likewise, analysis regarding staging (mise-en-scène) and screen placement (mise-en-cadre) revealed no specific bias.

Blog post | Paper | Dataset | Press coverage by Le Monde, Europe 1, RTS, Wolfram


The quantification of discrimination (2020-2021)

Workshop gathering researchers, civil servants and entrepreneurs involved in the quantification of various types of discrimination (AIDS patients, lawyers, students) using methods spanning from testing to artificial intelligence.

Website (mostly in French)


Filter bubbles on YouTube (2020)

An analysis of “topological confinement” of recommendations on YouTube. That is, piling up complexity until not so relevant data ends up fitting the common narrative that recommendation algorithms are nothing but evil. Keep on scrollin'!

Website (with code & data) | Paper


Neurons spike back! A history of AI research (2018)

Presenting artificial intelligence research spanning almost a century as a scientific controversy between connectionnist (neural networks, think ChatGPT) and symbolic (rules and logic, think spreadsheet) approaches. Greatly expands on how the current connectionnist trend is more of a second coming and the epistemological flame war that brought it down in the first place. Based on citation data analysis and several interviews of researchers having sailed through this academic row.

Website | Paper (also available in French) | Mention by Le Monde, Yann LeCun, MIT CSAIL


Origin discrimination in France (2018)

Using artificial intelligence to guess surname origins of several socio-professional populations in France (PMs, Lawyers, Ivy League's intake), this research formalises well documented patterns of discrimination in a country where statistics based on ethnicity are scarce if not outright forbidden. However, the robustness of the model is questionable. Therefore, method and results in their current state are unsuitable for professional auditing, policy making or discrimination studies. Yet, dabbling with onomastics offers a fascinating window to our past.

Website (with code and data) | Paper (in English) | Blog post (in French) | Press coverage by Le Monde


Google Borders (2016)

A playful interface to explore the broad range of suggestions made by Google based on your location and language. Since Google tends to suggest what other people are searching for, it may provide by proxy a glance at what people are looking up depending on where they are or the main language they speak.
Doesn't work as well as it used to, but using Firefox without any add-ons should allow you to sneak out a few insights.

Website | Papers: 2016, 2013 | Source Code | Press coverage by New Scientist


A Cartography of Machine Learning and its Algorithms (2016)

An attempt to study “styles of reasoning” specific to several machine learning algorithms through their history and their uses. Illustrated with various quantitative analysis of large corpora such as Web of Science, Kaggle and StackExchange.

Manuscript (in french)


Sexualitics: data porn, porn data (2014)

“When Big Data meets Porn” (The Atlantic) or “Google Trends for niche sexual interests” (Wired). More formally, a statistical analysis of millions of porn videos metadata exhibiting trends and clusters of categories.

Paper | Data | Code | Press coverage by Time, The Economist, Fast Company, Le Tag Parfait, L'Express, Street Press


Fabelier (2009-2014)

Co-founder and head of Fabelier, a hackerspace in Paris. Over 100 workshops and a community of 400 people rallied around the Web, Electronics, Neurosciences, Hacktivisme and much more.

Website (screenshot)


A Socio-Political Analysis of Free and Open-Source Software Communities (2009)

A social movement perspective on Free and Open-Source Software and hacker cultures. Various analogies are drawn between programming and reglementation, architecture or art. A typology of transgressive and collaborative logics that may be at stake in this context is drawn up.

Manuscript in Portuguese or French