Search Results for author: Michele Merler

Found 8 papers, 0 papers with code

A Comparative Analysis of Task-Agnostic Distillation Methods for Compressing Transformer Language Models

no code implementations13 Oct 2023 Takuma Udagawa, Aashka Trivedi, Michele Merler, Bishwaranjan Bhattacharjee

Our target of study includes Output Distribution (OD) transfer, Hidden State (HS) transfer with various layer mapping strategies, and Multi-Head Attention (MHA) transfer based on MiniLMv2.

Knowledge Distillation

NASTransfer: Analyzing Architecture Transferability in Large Scale Neural Architecture Search

no code implementations23 Jun 2020 Rameswar Panda, Michele Merler, Mayoore Jaiswal, Hui Wu, Kandan Ramakrishnan, Ulrich Finkler, Chun-Fu Chen, Minsik Cho, David Kung, Rogerio Feris, Bishwaranjan Bhattacharjee

The typical way of conducting large scale NAS is to search for an architectural building block on a small dataset (either using a proxy set from the large dataset or a completely different small scale dataset) and then transfer the block to a larger dataset.

Neural Architecture Search

Covering the News with (AI) Style

no code implementations5 Jan 2020 Michele Merler, Cicero Nogueira dos santos, Mauro Martino, Alfio M. Gliozzo, John R. Smith

We introduce a multi-modal discriminative and generative frame-work capable of assisting humans in producing visual content re-lated to a given theme, starting from a collection of documents(textual, visual, or both).

Automatic Curation of Golf Highlights using Multimodal Excitement Features

no code implementations22 Jul 2017 Michele Merler, Dhiraj Joshi, Quoc-Bao Nguyen, Stephen Hammer, John Kent, John R. Smith, Rogerio S. Feris

The production of sports highlight packages summarizing a game's most exciting moments is an essential task for broadcast media.

Action Recognition Retrieval +2

Cannot find the paper you are looking for? You can Submit a new open access paper.