CAVC: Cosine Attention Video Colorization

Leandro Stival, Ricardo Silva da Torres, Helio Pedrini

Research output: Chapter in Book/Report/Conference proceedingConference paperAcademicpeer-review

Abstract

Video colorization is a challenging task, demanding deep learning models to employ diverse abstractions for a comprehensive grasp of the task, ultimately yielding high-quality results. Currently, in example-based colorization approaches, the use of attention processes and convolutional layers has proven to be the most effective method to produce good results. Following this way, in this paper we propose Cosine Attention Video Colorization (CAVC), which is an approach that uses a single attention head with shared weights to produce a refinement of the monochromatic frame, as well as the cosine similarity between this sample and the other channels present in the image. This entire process acts as a pre-processing of the data from our autoencoder, which performs a feature fusion with the latent space extracted from the referent frame, as well as with its histogram. This architecture was trained on the DAVIS, UVO and LDV datasets and achieved superior results compared to state-of-the-art models in terms of FID metric in all the datasets.

Original languageEnglish
Title of host publicationProceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications
Subtitle of host publicationVolume 3: VISAPP
EditorsL. Stival, R. Torres, H. Pedrini
PublisherSciTePress
Pages385-392
Number of pages8
Volume3
ISBN (Print)9789897586798
DOIs
Publication statusPublished - 2024
Event19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2024 - Rome, Italy
Duration: 27 Feb 202429 Feb 2024

Publication series

NameProceedings of the International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications
ISSN (Print)2184-5921

Conference

Conference19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2024
Country/TerritoryItaly
CityRome
Period27/02/2429/02/24

Keywords

  • Attention Mechanism
  • Cosine Similarity
  • Deep Learning
  • Video Colorization

Fingerprint

Dive into the research topics of 'CAVC: Cosine Attention Video Colorization'. Together they form a unique fingerprint.

Cite this