{"publication_identifier":{"issn":["2663-337X"],"isbn":["978-3-99078-007-7"]},"page":"197","doi":"10.15479/AT:ISTA:8390","has_accepted_license":"1","author":[{"orcid":"0000-0002-8407-0705","id":"3811D890-F248-11E8-B48F-1D18A9856A87","full_name":"Royer, Amélie","first_name":"Amélie","last_name":"Royer"}],"language":[{"iso":"eng"}],"_id":"8390","citation":{"ieee":"A. Royer, “Leveraging structure in Computer Vision tasks for flexible Deep Learning models,” Institute of Science and Technology Austria, 2020.","mla":"Royer, Amélie. Leveraging Structure in Computer Vision Tasks for Flexible Deep Learning Models. Institute of Science and Technology Austria, 2020, doi:10.15479/AT:ISTA:8390.","short":"A. Royer, Leveraging Structure in Computer Vision Tasks for Flexible Deep Learning Models, Institute of Science and Technology Austria, 2020.","ista":"Royer A. 2020. Leveraging structure in Computer Vision tasks for flexible Deep Learning models. Institute of Science and Technology Austria.","chicago":"Royer, Amélie. “Leveraging Structure in Computer Vision Tasks for Flexible Deep Learning Models.” Institute of Science and Technology Austria, 2020. https://doi.org/10.15479/AT:ISTA:8390.","apa":"Royer, A. (2020). Leveraging structure in Computer Vision tasks for flexible Deep Learning models. Institute of Science and Technology Austria. https://doi.org/10.15479/AT:ISTA:8390","ama":"Royer A. Leveraging structure in Computer Vision tasks for flexible Deep Learning models. 2020. doi:10.15479/AT:ISTA:8390"},"acknowledged_ssus":[{"_id":"CampIT"},{"_id":"ScienComp"}],"year":"2020","ddc":["000"],"acknowledgement":"Last but not least, I would like to acknowledge the support of the IST IT and scientific computing team for helping provide a great work environment.","file_date_updated":"2020-09-14T13:39:17Z","tmp":{"legal_code_url":"https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode","short":"CC BY-NC-SA (4.0)","image":"/images/cc_by_nc_sa.png","name":"Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)"},"date_updated":"2023-10-16T10:04:02Z","supervisor":[{"orcid":"0000-0001-8622-7887","full_name":"Lampert, Christoph","id":"40C20FD2-F248-11E8-B48F-1D18A9856A87","first_name":"Christoph","last_name":"Lampert"}],"article_processing_charge":"No","month":"09","day":"14","degree_awarded":"PhD","file":[{"success":1,"file_id":"8391","file_size":30224591,"date_created":"2020-09-14T13:39:14Z","date_updated":"2020-09-14T13:39:14Z","content_type":"application/pdf","creator":"dernst","access_level":"open_access","file_name":"2020_Thesis_Royer.pdf","checksum":"c914d2f88846032f3d8507734861b6ee","relation":"main_file"},{"file_id":"8392","file_size":74227627,"date_created":"2020-09-14T13:39:17Z","date_updated":"2020-09-14T13:39:17Z","content_type":"application/x-zip-compressed","file_name":"thesis_sources.zip","access_level":"closed","creator":"dernst","relation":"main_file","checksum":"ae98fb35d912cff84a89035ae5794d3c"}],"license":"https://creativecommons.org/licenses/by-nc-sa/4.0/","type":"dissertation","abstract":[{"text":"Deep neural networks have established a new standard for data-dependent feature extraction pipelines in the Computer Vision literature. Despite their remarkable performance in the standard supervised learning scenario, i.e. when models are trained with labeled data and tested on samples that follow a similar distribution, neural networks have been shown to struggle with more advanced generalization abilities, such as transferring knowledge across visually different domains, or generalizing to new unseen combinations of known concepts. In this thesis we argue that, in contrast to the usual black-box behavior of neural networks, leveraging more structured internal representations is a promising direction\r\nfor tackling such problems. In particular, we focus on two forms of structure. First, we tackle modularity: We show that (i) compositional architectures are a natural tool for modeling reasoning tasks, in that they efficiently capture their combinatorial nature, which is key for generalizing beyond the compositions seen during training. We investigate how to to learn such models, both formally and experimentally, for the task of abstract visual reasoning. Then, we show that (ii) in some settings, modularity allows us to efficiently break down complex tasks into smaller, easier, modules, thereby improving computational efficiency; We study this behavior in the context of generative models for colorization, as well as for small objects detection. Secondly, we investigate the inherently layered structure of representations learned by neural networks, and analyze its role in the context of transfer learning and domain adaptation across visually\r\ndissimilar domains. ","lang":"eng"}],"department":[{"_id":"ChLa"}],"publication_status":"published","date_published":"2020-09-14T00:00:00Z","publisher":"Institute of Science and Technology Austria","user_id":"c635000d-4b10-11ee-a964-aac5a93f6ac1","date_created":"2020-09-14T13:42:09Z","oa":1,"oa_version":"Published Version","alternative_title":["ISTA Thesis"],"related_material":{"record":[{"status":"public","relation":"part_of_dissertation","id":"7936"},{"status":"public","id":"7937","relation":"part_of_dissertation"},{"status":"public","relation":"part_of_dissertation","id":"8193"},{"status":"public","relation":"part_of_dissertation","id":"8092"},{"status":"public","relation":"part_of_dissertation","id":"911"}]},"status":"public","title":"Leveraging structure in Computer Vision tasks for flexible Deep Learning models"}