1 / 28

Predicting User Preferences for Photo Squarization using Deep Neural Networks

This paper introduces a new approach to understanding human perception preferences for photo squarization. It proposes a two-module deep neural network that learns user preferences and predicts retargeting operations for photo squarization. The approach is evaluated on a new dataset with annotations indicating user preferences. The results are compared with state-of-the-art methods, demonstrating the effectiveness of the proposed approach.

bernicej
Download Presentation

Predicting User Preferences for Photo Squarization using Deep Neural Networks

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. CGW2019

  2. Computer Graphics Group of Visual System Lab at Dept. of Computer Science and Information Engineering, National Cheng – Kung University Visit us at: http://graphics.csie.ncku.edu.tw

  3. Outline

  4. 1. Introduction

  5. 1. Introduction Standard operation is cropping

  6. 1. Introduction Content-aware image retargeting (CAIR) methods

  7. 1. Introduction Content-aware image retargeting (CAIR) methods Most methods are developed by following one single scheme, thereby leading to adaptability problem on general displayinguse for social media

  8. 1. Introduction Multi – operator methods CAIR Seam carving Cropping Scaling Contents + compositions

  9. 1. Introduction Acquiring user preference information for retargeting a photo to square Challenges Understand human behavior in balancing three issues: salient content, object shape, scene composition Square shape Predicting squarization strategy that is consistent with human visual perception Three mentioned issues: Retargeting Visual Perception Issues (RVPIs)

  10. 1. Introduction Acquiring user preference information for retargeting a photo to square Challenges RVPIs Build a synergy Square shape CAIR Predicting squarization strategy that is consistent with human visual perception

  11. 2. Dataset

  12. 2. Dataset2.1. Perception Formulation Scene composition Object shape Salient content Retargeting Visual Perception Issues (RVPIs) balance Given a photo define [, , ] as the operation allocation performed on one dimension of I to obtain a square image

  13. 2. Dataset2.2. Data Collection 5.084 images collected on the basic of 3 principles: 3. Operability Duplicate images and images with too large or too small aspect ratio  picked out 2. Comprehensive Images from “RetargetMe” Popularity Collect from Flickr, Pinterest, Pexels Each participant square all 5.084 images by allocating the three operators

  14. 3. Our approach

  15. 3. Approach

  16. 3. Approach3.1. Problem Formulation Square Size (t, t); t = min(w, h) Size (w, h) Learning the mapping function Where + Learning goal

  17. 3. Approach3.1. Problem Formulation

  18. 3. Approach3.2. Learning approach Perception module Learn the relationship among the three kinds of operations Distillation module Soft target output Mapping the original image and perceptive value

  19. 3. Approach3.2. Learning approach output

  20. 3. Approach3.2. Learning approach Distillation module Transfer the output of the perception module (to distillation module. Output of perception module Loss functions: + () Learn the transferring The offset between input label and output Input image

  21. 4. Results

  22. Comparison of our results with those of other state-of-the-art methods for transverse images

  23. Comparison of our results with those of other state-of-the-art methods for longitudinal images

  24. Conclusion • Our work makes the following technical contributions: 1. A new way to observe human perception preferences among RVPIs during photo squarizationprocess. 2. A new dataset with annotations that indicates user preferences toward photo squarization problem 3. A new two-module deep neural network for learning user preferences and predicting retargeting operations for photo squarization.

  25. Thanks for your listening

  26. Appendix

More Related