West data festival

Démo

Jeudi 13

9h30 - 11h30

Salle 4

INTERVENTION PROPOSÉE PAR

DESCRIPTION

The exponential growth in Large Language Model
(LLM) deployment has intensified the need for efficient model
compression techniques to reduce computational costs and memory requirements. While pruning and quantization have shown
promising results, their combined potential remains largely
unexplored. In this paper, we examine joint compression and
how a strategic combination of pruning and quantization could
achieve superior compression-to-performance ratios compared
to individual-method approaches. Recognizing the challenges
in accurately assessing LLMs performance, we address key
limitations of previous evaluation frameworks and introduce the
Semantic Retention Compression Rate (SrCr), a novel metric that
quantifies the trade-off between model compression and semantic
preservation, facilitating optimization of pruning-quantization
configurations. Experiments demonstrate that our recommended
combination achieves on average a 20% performance increase
compared to equivalent quantization-only model at the same
theoretical compression-ratio.

SPEAKERS

Stanislas LABORDE

Etudiant

ESIEA

AUTRES INTERVENTIONS

Jeudi 13

15h00 - 15h15

Challenge IA Luminess x ESIEA

Salle des congrès

Conférence

Jeudi 13

9h30 - 11h30

Vision EEG

Salle 4

Démo

Jeudi 13

9h30 - 11h30

Conception d’un système embarqué de vision par ordinateur avec IA pour la détection e...

Salle 4

Démo

Jeudi 13

9h30 - 11h30

Accurate Hand Contact Detection from RGB Images via Image-to-Image Translation

Salle 4

Démo

Jeudi 13

9h30 - 11h30

Federated TimeGAN for Privacy Preserving Synthetic Trajectory Generation

Salle 4

Démo

Jeudi 13

9h30 - 11h30

APT cyberattack simulation and intent-based mitigation planning

Salle 4

Démo

Semantic Retention and Extreme Compression in LLMs: Can We Have Both?

INTERVENTION PROPOSÉE PAR

DESCRIPTION

SPEAKERS

AUTRES INTERVENTIONS

Challenge IA Luminess x ESIEA

Vision EEG

Conception d’un système embarqué de vision par ordinateur avec IA pour la détection e...

Accurate Hand Contact Detection from RGB Images via Image-to-Image Translation

Federated TimeGAN for Privacy Preserving Synthetic Trajectory Generation

APT cyberattack simulation and intent-based mitigation planning