面向机器视觉任务的多尺度图像特征压缩算法

Translated title of the contribution: Image Multi-Scale Feature Compression Algorithm for Machine Vision Tasks

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Fortheproblemsthat inthecollaborativeintelligenceframework theintermediate featuredataof machinevisiontasksislargeanddifficulttotransmitefficiently,a multi-scale imagefeaturefusioncompressionalgorithm wasproposed.Firstly acascadedresidualtransformationmodule wasdesignedaccordingtothe multi-scalefeaturesoutputbythedeeplearning modelontheedgedevice theredundancyofmulti-scalefeatureswaseliminatedbystepwisesubtractionoffeaturesofdifferentsizes andtheresidualfeatureswerecompressedtoaunifiedsize. Then anautoencoderwasdesignedtoeliminatethestatisticalredundancyofcompactfeaturesby arithmeticcoding.Next a prediction andreconstruction module was designed onthecloud accordingtothecompactfeaturesofdecodingtogeneratethepredictionfeatures which were combinedwiththeresidualfeaturestoaccuratelyreconstructthemulti-scalefeatures.Finally,a jointoptimizationfunction wasbuiltforthecollaborativeoptimizationofthe modulesincluding residualtransformation autoencoder andpredictionreconstruction thusachievingtheoptimal trade-offbetweentransmissionbitrateandinformationrepresentationability.Thesimulation resultsshowthattheproposedalgorithmhasnotonlythelargestfeaturecompressionratio but alsothemostcompletereconstructedfeaturesinthespacecompression andthatwhenthetransmissionbitrateis0.1bpp themodelaccuracyoftheproposedalgorithmisimprovedby8.57% and3.87% respectively comparedwiththeimagecodingalgorithm VVCandthefeaturecompressionalgorithm MSFC.Thisstudycanprovidetechnicalsupportforthecodingframeworkof machinevision andhascertainvalueinengineeringapplication.

Translated title of the contributionImage Multi-Scale Feature Compression Algorithm for Machine Vision Tasks
Original languageChinese (Traditional)
Pages (from-to)1-10
Number of pages10
JournalHsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University
Volume57
Issue number12
DOIs
StatePublished - Dec 2023

Fingerprint

Dive into the research topics of 'Image Multi-Scale Feature Compression Algorithm for Machine Vision Tasks'. Together they form a unique fingerprint.

Cite this