Connecting Science to Innovation: Open Data and Machine Learning Approaches
Matt Marx
Cornell University, USA, and NBER
Abstract:
Prof. Marx will discuss advances in open data, machine learning and their role in innovation research. He will expand on one of his latest projects, which systematically links scientific publications to technological outputs. In this project, the authors provide a dataset of Patent–Paper Pairs (PPPs) across all fields of science, identifying instances where authors of scientific papers also exploit their discoveries in patented inventions. To do so, they train a random forest model based on a combination of hand-checked PPPs. The dataset is then used to revisit the perennial question of whether the patent system fulfills its objective to “promote the progress of science”. Prof. Marx will then conclude by reflecting on how machine learning, and in particular Large Language Models (LLMs), can be leveraged to open new opportunities for large-scale, data-intensive research on science, technology, and innovation.
A discussion will follow by Gianluca Tarrasconi, Chief Data Officer at ipQuants AG, who will offer his perspective on the potential of patent data for innovation analysis and share insights from his startup experience applying LLMs to patent databases to develop specialized reporting and analytics services.
Matt Marx is the Bruce F. Failing, Sr. Chair in Entrepreneurship at the Cornell SC Johnson College of Business, and is the inaugural Faculty Director of Entrepreneurship@Cornell. He leads the Innovation Information Initiative (I3), curates several open datasets at relianceonscience.org, serves as Department Editor for Innovation & Entrepreneurship at Management Science, and is a Research Associate at the National Bureau of Economic Research (NBER). Matt was previously an executive and inventor at two successful startup companies and holds six patents.
Data
15:00 - 16:30
Luogo
Building BL26 – Rooms 0.18 and 0.19 (ground floor) Department of Management, Economics and Industrial Engineering Via R. Lambruschini, 4/BOrganizzatore
Politecnico di MilanoEventi
11 →
novembre
11
dicembre 2025
LABORATORI POLITECNICI dipartimenti | territorio | didattica | tecnologia
01
dicembre 2025
Il profitto e la cura. La sostenibilità e le voci che non abbiamo ascoltato
02
dicembre 2025
Digital & Open Innovation 2026: cosa serve a imprese e startup per un cambio di passo
02 →
dicembre
03
dicembre 2025
Per il grande pubblico. L’architettura tra dibattito, divulgazione e strategie narrative
04
dicembre 2025
Omnichannel Customer Experience: si accelera con l’AI ma senza fondamenta solide
10
dicembre 2025
Digital readiness and skills: a portrait of Europe’s large and super-large enterprises
11
dicembre 2025
Convegno dei risultati di Ricerca dell’Osservatorio Fintech & Insurtech
11
dicembre 2025
CAI-Polimi per CORTALP: il festival / concorso del cortometraggio di Montagna Milano con il Premio "Manlio Armellini"
15
dicembre 2025
Echoes of Discord: The Effects of Global Conflicts and Disputes on Entrepreneurship
18
dicembre 2025