{"id":3883,"date":"2015-07-09T14:35:24","date_gmt":"2015-07-09T18:35:24","guid":{"rendered":"https:\/\/www.bu.edu\/smartlighting\/?page_id=3883"},"modified":"2015-07-09T14:36:09","modified_gmt":"2015-07-09T18:36:09","slug":"privacy-preserving-smart-room-analytics","status":"publish","type":"page","link":"https:\/\/www.bu.edu\/smartlighting\/privacy-preserving-smart-room-analytics\/","title":{"rendered":"Privacy Preserving Smart-Room Analytics"},"content":{"rendered":"<p><strong>Team<\/strong>: J. Dai, J.Wu, B. Saghafi, J. Konrad, P. Ishwar<br \/>\n<strong>Funding<\/strong>: This material is based on work supported by the US National Science Foundation under <a href=\"https:\/\/smartlighting.rpi.edu\/\" target=\"_blank\">Smart Lighting ERC<\/a> Cooperative Agreement No. EEC-0812056<br \/>\n<strong>Status<\/strong>: Ongoing (2014-\u2026)<\/p>\n<p><strong>Summary:<\/strong> Although extensive research on action recognition has been carried out using standard video cameras, little work has explored recognition performance at extremely low temporal or spatial camera resolutions. Reliable action recognition in such a \u201cdegraded\u201d environment would promote the development of privacy-preserving smart rooms that would facilitate intelligent interaction with its occupants while mitigating privacy concerns. This project aims to explore the trade-off between action recognition performance, number of cameras, and temporal and spatial resolution in a smart-room environment.<\/p>\n<p><center><\/center><a href=\"http:\/\/vip.bu.edu\/files\/2015\/05\/smartroom.png\"><img src=\"http:\/\/vip.bu.edu\/files\/2015\/05\/smartroom.png\" alt=\"A seminar room simulated in Unity3D with 5 ceiling-mounted cameras in a pentagonal arrangement.\" width=\"360\" class=\"size-full wp-image-3105\" \/><\/a><\/p>\n<p>As it is impractical to build a physical platform to test every combination of camera positions and resolutions, we use a graphics engine (Unity3D) to simulate a room with various avatars animated using motions captured from real subjects with a Kinect v2 sensor.<\/p>\n<p><center><\/center><a href=\"http:\/\/vip.bu.edu\/files\/2015\/05\/avatars.png\"><img src=\"http:\/\/vip.bu.edu\/files\/2015\/05\/avatars.png\" alt=\"Snapshots from various camera viewpoints of an avatar raising his arm. \" width=\"460\" class=\"size-full wp-image-3106\" \/><\/a><\/p>\n<p>We study the performance impact of spatial resolutions from a single pixel up to 10\u00d710 pixels (an extremely low spatial resolution), the impact of temporal resolutions from 2 Hz up to 30 Hz and the impact of using up to 5 ceiling cameras.<\/p>\n<p><center><\/center><a href=\"http:\/\/vip.bu.edu\/files\/2015\/05\/resolution.png\"><img src=\"http:\/\/vip.bu.edu\/files\/2015\/05\/resolution.png\" alt=\"Extremely low spatial resolution snapshots\" width=\"460\" class=\"size-full wp-image-3115\" \/><\/a><\/p>\n<p>Results of this study indicate that reliable action recognition for smart-room centric gestures is possible in environments with extremely low temporal and spatial resolutions. An overview of these results is shown in the table below:<\/p>\n<p><center><a href=\"http:\/\/vip.bu.edu\/files\/2015\/05\/results.png\"><img loading=\"lazy\" src=\"http:\/\/vip.bu.edu\/files\/2015\/05\/results.png\" alt=\"Results\" width=\"531\" height=\"179\" class=\"size-full wp-image-3124\" \/><\/a><\/center><span>When using 5, single-pixel cameras at 30Hz we can achieve a correct classification rate (CCR) of 75.70% across 9 actions which is only 13.9% lower than the CCR for the same camera set up at 10\u00d710 pixels. In terms of the impact on action recognition performance, we found that spatial resolution has the highest impact, followed by number of cameras, and finally temporal resolution (frame rate).<\/span><\/p>\n<p><strong>Additional resources on this project will be added soon.<\/strong><\/p>\n<p>For a more in-depth explanation of our methodology and the aforementioned results please refer to our paper below.<\/p>\n<p><strong>Publications:<\/strong><\/p>\n<ol>\n<li><b>J. Dai, J. Wu, B. Saghafi, J. Konrad, and P. Ishwar<span>, <\/span><\/b><span>\u201cTowards Privacy-Preserving Activity Recognition Using Extremely Low Temporal and Spatial Resolution Cameras,\u201d in <\/span><span><em>Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Workshop on Analysis and Modeling of Faces and Gestures (AMFG)<\/em>, June. 2015.<\/span><\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>Team: J. Dai, J.Wu, B. Saghafi, J. Konrad, P. Ishwar Funding: This material is based on work supported by the US National Science Foundation under Smart Lighting ERC Cooperative Agreement No. EEC-0812056 Status: Ongoing (2014-\u2026) Summary: Although extensive research on action recognition has been carried out using standard video cameras, little work has explored recognition [&hellip;]<\/p>\n","protected":false},"author":2469,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":[],"_links":{"self":[{"href":"https:\/\/www.bu.edu\/smartlighting\/wp-json\/wp\/v2\/pages\/3883"}],"collection":[{"href":"https:\/\/www.bu.edu\/smartlighting\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.bu.edu\/smartlighting\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.bu.edu\/smartlighting\/wp-json\/wp\/v2\/users\/2469"}],"replies":[{"embeddable":true,"href":"https:\/\/www.bu.edu\/smartlighting\/wp-json\/wp\/v2\/comments?post=3883"}],"version-history":[{"count":2,"href":"https:\/\/www.bu.edu\/smartlighting\/wp-json\/wp\/v2\/pages\/3883\/revisions"}],"predecessor-version":[{"id":3885,"href":"https:\/\/www.bu.edu\/smartlighting\/wp-json\/wp\/v2\/pages\/3883\/revisions\/3885"}],"wp:attachment":[{"href":"https:\/\/www.bu.edu\/smartlighting\/wp-json\/wp\/v2\/media?parent=3883"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}