Welcome to the official page of the Picariello Data
Challenge 2022 organized by
the University of Naples "Federico II" and sponsored
by
Eustema s.p.a.
The Challenge will be
open to groups of students coming from all departments of Federico II.
Contacts:
Dr. Donato Cappetta (EUSTEMA)
Prof. Giuseppe Longo
(Dip. di Fisica "Ettore Pancini")
Prof. Carlo Sansone
(Dip. di Ing. Elettrica e delle Tecnologie dell'Informazione)
If you want to know more about the challenge press → on your keyboard.
The objective of the challenge is to simulate
an industrial Data Science project as you would encounter
working as a data scientist for a company or into an interdisciplinary
research group. The challenge is about predicting the outcome,
the duration in number of days and the settlement for
around 300,000
lawsuits produced by several tribunals of the "Giudice di Pace" scattered
around Italy.
In order to register to the challenge and to see the timeline take a look
at Registration and Timeline
To know more about the data, take a look at Data
To know more about the scoring, take a look at Scoring
To know how to submit your results, take a look at Submission
At the beginning of the challenge, each group will be provided with training sets containing all the input data fields and the values for the target variables and blind test sets for which you will have to make predictions. Given that we are simulating an industrial data science project, we are not only interested about the performances of your algorithms in predicting the target variables, but also on the completeness and correctness of your Exploratory Data Analysis (EDA) and the quality of both your code and report explaining your strategy to solve the problems and the details of the algorithms and method used.
Members of both groups will have the possibility of doing an internship/thesis in Eustema (hiring path)
In order to register to the challenge, you need to form a group of 3 - 4 students (the composition of the groups cannot change during the challenge) and send an email to infodatascience@unina.it containing the following informations:
You will receive a reply containg the code to join the Challenge Team on Microsoft Teams. The Team is where the challenge will be presented, where announcements will be made, where you will find the data (when the challenge begins) and where you will have to submit your deliverables.
The data will be provided on Microsoft Teams at the beginning of the challenge and will be consituted by 4 .csv files:
The following data fields are present:
Soon after your group registration, and before the challenge begins, a folder accessible only to the members of your group will be created in the Files folder of the Challenge Team. In order to complete the challenge your group will have to upload the following files before the 16th of September at midnight:
THe challenge will be evaluated by a committee chaired by Prof. Roberta Siciliano and including faculty members of UNINA and Data Scientists from EUSTEMA. The scoring of the challenge is based on a point system in which the minimum score is 0 and the maximum is 100. The total number of points is divided among the three tasks of the challenge plus the final presentation as it follows:
The following metrics will be used to evaluate your predictions:
The presentations of the finalist groups will be judged by the commission and up to 20 points will be awarded on the basis of the ability of the groups to convince the commission and sell their solution.