SCIENTIA SINICA Informationis, Volume 48 , Issue 8 : 1076-1082(2018) https://doi.org/10.1360/N112018-00025

Digital retina: revolutionizing camera systems for the smart city

More info
  • ReceivedJan 31, 2018
  • AcceptedMar 3, 2018
  • PublishedMay 21, 2018


Funded by





[1] Gao W, Tian Y H, Huang T J. The IEEE 1857 standard: empowering smart video surveillance systems. IEEE Intell Syst, 2014, 29: 30-39 CrossRef Google Scholar

[2] Gao W, Ma S W. An overview of AVS2 standard. In: Advanced Video Coding Systems. Berlin: Springer, 2015. 35--49. Google Scholar

[3] Silveira R A D, Roska B. Cell types, circuits, computation. Curr Opin Neurobiol, 2011, 21: 664-671 CrossRef PubMed Google Scholar

[4] Zhang X G, Huang T J, Tian Y H. Background-modeling based adaptive prediction for surveillance video coding. IEEE Trans Image Process, 2014, 23: 769-784 CrossRef PubMed ADS Google Scholar

[5] Duan L Y, Chandrasekhar V, Chen J. Overview of the MPEG-CDVS standard. IEEE Trans Image Process, 2016, 25: 179-194 CrossRef PubMed ADS Google Scholar

[6] Ding L, Tian Y H, Fan H F. Rate-performance-loss optimization for inter-frame deep feature coding from videos. IEEE Trans Image Process, 2017, 26: 5743-5757 CrossRef PubMed Google Scholar

[7] Zhang X, Ma S W, Wang S S. A joint compression scheme of video feature descriptors and visual content. IEEE Trans Image Process, 2017, 26: 633-647 CrossRef PubMed ADS Google Scholar

  • Figure 1

    (Color online) The effect of video compression on different analysis and retrieval tasks, including (a) visual search, (b) face recognition, and (c) person re-identification. In the experiments, we selected one benchmark dataset for each task, and utilized the state-of-the-art AVS2 codec to obtain the reconstructed images and videos with different quantization parameters (QPs). Then the reconstructed images and videos were used to evaluate the performance of different tasks

  • Figure 2

    (Color onine) The compound-eye camera system for the smart city by connecting a large number of digital retinas. In this system, feature streams can be realtime aggregated into the city brain, while video streams are saved in the local storages and pulled to the city brain only on demand


Contact and support