English  |  正體中文  |  简体中文  |  Post-Print筆數 : 27 |  Items with full text/Total items : 112721/143689 (78%)
Visitors : 49535583      Online Users : 952
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version
    Please use this identifier to cite or link to this item: https://nccur.lib.nccu.edu.tw/handle/140.119/134198


    Title: 運用人臉辨識技術於歷史照片之分析
    Application of Deep Face Recognition Techniques to the Analysis of Historical Photos
    Authors: 林琬儒
    Lin, Wan-Ju
    Contributors: 廖文宏
    林琬儒
    Lin, Wan-Ju
    Keywords: 人臉偵測
    深度學習
    人臉識別
    歷史照片
    Face detection
    Face recognition
    Deep learning
    Historical photos
    Date: 2021
    Issue Date: 2021-03-02 14:55:43 (UTC+8)
    Abstract: 國內外文史單位,蒐集歷史老照片並致力於檔案數位化,然而這些照片尚有許多資訊內容,例如人、事、時、地、物,須當事人或其家屬、親友等,協助辨識確認。相關人士或旅居海外,或年事已高,因此需要建置友善操作介面的網站,讓這些目擊歷史事件的耆老們提供寶貴記憶,為珍貴的史料記錄其來龍去脈。
    原本上述須完全倚靠人工辨識、描述的資訊內容之作業,是否能採用電腦視覺技術予以協助、加速?我們在前述提及之數位典藏歷史相簿標記網站中,除了提供上傳照片以及建置描述資料(metadata)功能外,也加入人臉識別推薦功能輔助識別照片中的人物。最後,對打字不熟悉的長輩,亦可透過錄音的方式記錄和老照片之相關資訊,此音檔亦可視為一種珍貴的口述歷史保存之標的。
    本論文建置基於蒐集歷史圖像為主要資料集的網站,並應用電腦視覺技術,開發從人臉偵測(face detection)到人臉識別(face recognition)的端對端(end-to-end)流程,盼本研究之貢獻能造福有文史圖片分析需求之典藏單位。
    Cultural and historical institutions collect and digitize historical photos for archiving purposes. However, information regarding these photos, including identity, event, time, place, and objects need to be identified and confirmed. The relevant people may live overseas or are quite aged. It is thus beneficial to build a website with a friendly user interface, so that the elderly who witnessed historical events can share their valuable memories by contributing precious historical materials.
    Computer vision technology can be used to assist and accelerate the above-mentioned operations that relied solely on human identification and description. In the historical album website, in addition to basic functions such as uploading photos and adding metadata, we also implement face recognition recommendation to assist in identifying people in photos. Elderly who are unfamiliar with typing can also record related information for photos through voice recording. This audio file can also be stored for the preservation of oral history.
    This thesis builds a website based on the collection of historical images, and adopts computer vision technology to an end-to-end process from face detection to face recognition. We hope that this research can benefit the institutions that have the need for the analysis of cultural and historical pictures.
    Reference: [1] 林素甘, 楊美華, & 柯皓仁. (2008). 數位化發展對檔案典藏與保存之影響.
    [2] "勝利之吻," in https://zh.wikipedia.org/wiki/%E8%83%9C%E5%88%A9%E4%B9%8B%E5%90%BB
    [3] "飢餓的蘇丹," in https://zh.wikipedia.org/wiki/%E9%A3%A2%E9%A4%93%E7%9A%84%E8%98%87%E4%B8%B9
    [4] "EXIF wiki," in http://en.wikipedia.org/wiki/Exchangeable_image_file_format.
    [5] XU, Donna, et al. Survey on multi-output learning. IEEE transactions on neural networks and learning systems, 2019.
    [6] 張婷雅 (2016),臉書相片分類及使用者樣貌分析,碩士論文,政治大學資訊科學系,臺北。
    [7] 蔡旻琪(2019)。影像分析灣裡葉姓家族的生活記憶。國立臺南大學文化與自然資源學系碩士班碩士論文,台南市。 取自https://hdl.handle.net/11296/354e8z
    [8] 周憶卿(2014)。關渡老照片的敘事:在地成長的記憶。臺北市立大學視覺藝術學系視覺藝術教學碩士學位班碩士論文,臺北市。 取自https://hdl.handle.net/11296/6dvrp3
    [9] "Image captioning with visual attention," in https://www.tensorflow.org/tutorials/text/image_captioning
    [10] "Google相簿 wiki," in https://zh.wikipedia.org/wiki/Google%E7%9B%B8%E7%B0%BF
    [11] "Google相簿說明," in https://support.google.com/photos/answer/6153599?co=GENIE.Platform%3DDesktop&hl=zh-Hant
    [12]"Google Cloud Vision API," in https://cloud.google.com/vision/
    [13] "InsightFace: 2D and 3D Face Analysis Project," in https://github.com/deepinsight/insightface
    [14] CHEN, Tianqi, et al. Mxnet: A flexible and efficient machine learning library for heterogeneous distributed systems. arXiv preprint arXiv:1512.01274, 2015.
    [15] HE, Kaiming, et al. Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition. 2016. p. 770-778.
    [16] CHEN, Sheng, et al. Mobilefacenets: Efficient cnns for accurate real-time face verification on mobile devices. In: Chinese Conference on Biometric Recognition. Springer, Cham, 2018. p. 428-438.
    [17] HOWARD, Andrew G., et al. Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861, 2017.
    [18] SZEGEDY, Christian, et al. Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE conference on computer vision and pattern recognition. 2016. p. 2818-2826.
    [19] PLEISS, Geoff, et al. Memory-efficient implementation of densenets. arXiv preprint arXiv:1707.06990, 2017.
    [20] DENG, Jiankang, et al. Arcface: Additive angular margin loss for deep face recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2019. p. 4690-4699.
    [21] DENG, Jiankang, et al. Sub-center arcface: Boosting face recognition by large-scale noisy web faces. In: European Conference on Computer Vision. Springer, Cham, 2020. p. 741-757.
    [22] "InsightFace_Pytorch," in https://github.com/TreB1eN/InsightFace_Pytorch/blob/master/README.md
    [23] ZHANG, Kaipeng, et al. Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Processing Letters, 2016, 23.10: 1499-1503.
    [24]DENG, Jiankang, et al. Retinaface: Single-stage dense face localisation in the wild. arXiv preprint arXiv:1905.00641, 2019.
    [25] "Ethics in action: removing gender labels from Cloud’s Vision API, "in https://diversity.google/story/ethics-in-action-removing-gender-labels-from-clouds-vision-api/
    [26] "Word error rate," in https://en.wikipedia.org/wiki/Word_error_rate
    [27] "Google cloud 語音轉文字, " in https://cloud.google.com/speech-to-text?hl=zh-tw#section-12
    [28] udntvArt, "20140515《藝想世界》後代無私捐贈 羅家倫萬冊藏書落腳政大," in https://www.youtube.com/watch?app=desktop&v=N1Us-R4e3eM
    [29] 資料授權來源:國立政治大學圖書館特藏管理組(2021),臺北。
    [30] "Check orientation," in https://github.com/ternaus/check_orientation
    [31] "Open Images Dataset V6 + Extensions," in https://storage.googleapis.com/openimages/web/index.html
    [32] "Detect faces , Google Cloud文件," in https://cloud.google.com/vision/docs/detecting-faces
    [33] "定價, Google Cloud文件," in https://cloud.google.com/vision/pricing
    [34] "Face Challenges," in https://www.nist.gov/programs-projects/face-challenges
    [35] LIN, Tsung-Yi, et al. Focal loss for dense object detection. In: Proceedings of the IEEE international conference on computer vision. 2017. p. 2980-2988.
    [36] NAJIBI, Mahyar, et al. Ssh: Single stage headless face detector. In: Proceedings of the IEEE international conference on computer vision. 2017. p. 4875-4884.
    [37] YI, Dong, et al. Learning face representation from scratch. arXiv preprint arXiv:1411.7923, 2014.
    Description: 碩士
    國立政治大學
    資訊科學系碩士在職專班
    102971017
    Source URI: http://thesis.lib.nccu.edu.tw/record/#G0102971017
    Data Type: thesis
    DOI: 10.6814/NCCU202100276
    Appears in Collections:[資訊科學系碩士在職專班] 學位論文

    Files in This Item:

    File Description SizeFormat
    101701.pdf7710KbAdobe PDF2100View/Open


    All items in 政大典藏 are protected by copyright, with all rights reserved.


    社群 sharing

    著作權政策宣告 Copyright Announcement
    1.本網站之數位內容為國立政治大學所收錄之機構典藏,無償提供學術研究與公眾教育等公益性使用,惟仍請適度,合理使用本網站之內容,以尊重著作權人之權益。商業上之利用,則請先取得著作權人之授權。
    The digital content of this website is part of National Chengchi University Institutional Repository. It provides free access to academic research and public education for non-commercial use. Please utilize it in a proper and reasonable manner and respect the rights of copyright owners. For commercial use, please obtain authorization from the copyright owner in advance.

    2.本網站之製作,已盡力防止侵害著作權人之權益,如仍發現本網站之數位內容有侵害著作權人權益情事者,請權利人通知本網站維護人員(nccur@nccu.edu.tw),維護人員將立即採取移除該數位著作等補救措施。
    NCCU Institutional Repository is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff(nccur@nccu.edu.tw). We will remove the work from the repository and investigate your claim.
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - Feedback