Federico Chinello

Researcher · Artificial Intelligence

About

I am currently completing an MSc in Artificial Intelligence at Bocconi University. I work as a Research Fellow at the AIRC Institute of Molecular Oncology (IFOM) in Ylli Doksani’s lab, where I develop computer vision and AI solutions to support biomedical research. I am also part of the Computational Biology group led by Prof. Francesca Buffa at Bocconi, and I am conducting my master’s thesis on Convolutional Set Transformers—a novel neural architecture we introduced here—under the supervision of Prof. Giacomo Boracchi (Politecnico di Milano). My research interests focus on deep learning architectures and methods, with particular emphasis on applications in computer vision and bioinformatics.

Artificial Intelligence Deep Learning Machine Learning Computer Vision Bioinformatics

News & Highlights

Oct 2025: I'm looking for a PhD position in AI/ML! 🚨
Oct 2025: I've joined the AIRC Institute of Molecular Oncology as a Research Fellow!
Sep 2025: The Convolutional Set Transformer preprint is available on ArXiv!

Publications

Convolutional Set Transformer [ pdf · code ]
Chinello, F., & Boracchi, G., arXiv preprint, 2025.
We introduce a novel neural architecture designed to process image sets of arbitrary cardinality that are visually heterogeneous yet share high-level semantics (such as a common category, scene, or concept).

Software

cstmodels [ code · docs · PyPI ]
Chinello, F., 2025.
This package, available on PyPI, provides the reference implementation of the Convolutional Set Transformer. It includes reusable Keras 3 layers for building CST architectures, and provides an easy interface to load and use the CST-15 model pre-trained on ImageNet.

Selected Results

Set Anomaly Detection is a binary classification task meant to identify images in a set that are anomalous or inconsistent with the majority of the set. Here, the notion of anomaly is relative: the same image may be considered anomalous in one set but not in another, depending on the surrounding context. The Figure below shows two image sets derived from the CelebA dataset (Liu et al., 2015). In each set, a majority of normal images share two attributes ("wearing hat" and "smiling" in the first, "no beard" and "attractive" in the second), while a minority lack these attributes and are thus anomalous. After training a CST and a Set Trasnsformer (Lee et al., 2019) on CelebA for Set Anomaly Detection, we evaluate the explainability of their predictions by overlaying Grad-CAMs on anomalous images. CST explanations correctly highlight the anomalous regions, whereas ST explanations fail to provide meaningful insights. For more details see Chinello & Boracchi (2025).

The Figure below presents a qualitative comparison of Grad-CAMs generated for our CST-15 model (28M params), ConvNeXt-XL (350M params), ResNet50 (26M params), and VGG-19 (144M params). To ensure a fair comparison, we feed isolated images without context to CST-15. Explanations are computed with respect to the ground truth class. CST-15 Grad-CAMs are more precise and focused compared to ResNet50 and VGG-19, and comparable or even better than ConvNeXt-XL explanations. Consider, for instance, the first image (first row in the Figure below). It depicts a space shuttle being transported by a shuttle carrier aircraft. Even to the human eye, it is difficult to distinguish the shuttle from the carrier aircraft. However, the CST-15 explanation map accurately identifies the space shuttle, distinguishing it from the aircraft. In contrast, the Grad-CAMs generated for the other models are significantly less precise, highlighting a coarse region that encompasses both the shuttle and the carrier aircraft. For more details see Chinello & Boracchi (2025).

Contact & Links

Download CST-15

We introduce CST-15, a CST with 28M parameters trained on ImageNet. CST-15 is the first set-learning backbone pre-trained on a large-scale dataset. It can be adapted to diverse downstream tasks via Transfer Learning. Notably, CSTs fully support standard CNN explainability tools. The Figure above shows Grad-CAM overlays for an image set provided as input to CST-15, with respect to the ground-truth class. CST-15 is readily available through our cstmodels package (pip install cstmodels), which also offers reusable Keras 3 layers for building custom CST architectures from scratch. Source Code API Docs