English  |  正體中文  |  简体中文  |  Post-Print筆數 : 27 |  Items with full text/Total items : 109874/140825 (78%)
Visitors : 45913185      Online Users : 479
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version
    政大機構典藏 > 理學院 > 應用數學系 > 學位論文 >  Item 140.119/146297
    Please use this identifier to cite or link to this item: https://nccur.lib.nccu.edu.tw/handle/140.119/146297


    Title: 大英線上圖書館與倫敦大學體系線上圖書館上架編碼的數位考古
    The digit archeology about Listing code of online British Library and University of London
    Authors: 陳以洵
    Chen, Yi-Hsun
    Contributors: 曾正男
    Tzeng,Jeng-Nan
    陳以洵
    Chen, Yi-Hsun
    Keywords: 論文比對
    網路爬蟲
    錯排問題
    隨機抽樣
    排序一致性
    Z分數
    箱型圖法
    Thesis Comparison
    Web Scraping
    Derangements
    Random Sampling
    Sequential Consistency
    Z-Score
    Box Plot Method
    Date: 2023
    Issue Date: 2023-08-02 13:01:59 (UTC+8)
    Abstract: 由於申請國外學位論文證明的時間成本較高,本論文目標為利用公開網路資訊,來建制出一套學位論文離群程度的初階篩選,我們以Python Selenium及BeautifulSoup針對大英線上圖書館(British Library EThOS)與倫敦大學體系下的倫敦政經學院(LSE)線上圖書館的論文資料為例,論證在這兩邊線上圖書館論文上架編碼的排序方式是否具有一定程度的一致性,共同作為學位論文離群程度檢核的一種參考。

    考古是為了還原過去的歷史真相,利用網路公開資訊還原真相的過程,我們稱為數位考古。本論文定義一個同序矩陣,建立評量函數,透過排序的差異度來評斷論文上架時間的離群程度。藉此指標,若驗證學位時發現有嚴重離群此指標平均的論文,我們才需特別用正式管道申請的方式來驗證。
    Given the high time cost of applying for foreign degree thesis certification, the aim of this paper is to use publicly available online information to establish a preliminary screening system for the degree of outlier in theses. We use Python Selenium and BeautifulSoup to examine thesis data from the British Library EThOS and the online library of the London School of Economics (LSE) under the University of London system. We argue whether the sorting methods of thesis coding on these two online libraries have a certain degree of consistency, both serving as a reference for checking the degree of outlier in theses.

    Archaeology is for the purpose of restoring the historical truth of the past, and the process of using publicly available online information to restore the truth, we call it digital archaeology. This paper defines a permutation matrix and establishes an evaluation function. The degree of deviation in sorting is used to judge the outlier degree of thesis shelf time. With this index, if a severe outlier is found during degree verification, we only need to verify it by applying through formal channels.
    Reference: [1] 蔡壁如論文遭指「不當引用」 德明科大證實:啟動審理 (https://news.tvbs.com.tw/politics/1877096)

    [2] 蔡壁如為論文驟然告別立院 4個考量設下停損點 (https://vip.udn.com/vip/story/122367/6688730)

    [3] 林智堅「論文門」懶人包不斷更新:兩派到底吵什麼?後續有何發展?論文爭議始末一次看 (https://ynews.page.link/9Pb8)

    [4] 台大認定林智堅論文抄襲撤銷碩士學位 教育部暫未收到訴願申請 (https://ynews.page.link/gpXM)

    [5] 快訊》林智堅將主動退選!鄭運鵬接棒選桃園市長 (https://ctsnews.page.link/3jHaq)

    [6] 週刊爆博士論文涉抄襲,高虹安公布辛辛那提大學校方聲明強調無版權問題,「我不是林智堅」(https://www.thenewslens.com/article/173039)

    [7] 快訊/博士論文突遭母校下架?高虹安回應了 (https://ynews.page.link/CnkFp)

    [8] D. M. Thomas and S. Mathur, "Data Analysis by Web Scraping using Python," 2019 3rd International conference on Electronics, Communication and Aerospace Technology (ICECA), Coimbatore, India, 2019, pp. 450-454, doi: 10.1109/ICECA.2019.8822022.

    [9] Boeing, G.; Waddell, P. (2017). New Insights into Rental Housing Markets across the United States: Web Scraping and Analyzing Craigslist Rental Listings. Journal of Planning Education and Research, 37(4), 457–476.

    [10] IDRIS, Aizal Yusrina; BAMOALLEM, Razan; MOHAMAD HATTA, Mohamad Harith Azfar. Web Scraping and Regression Analysis based on Machine Learning for COVID-19 with Rapid Software Platform. Mathematical Sciences and Informatics Journal, [S.l.], v. 3, n. 1, p. 75-85, may 2022. ISSN 2735-0703.

    [11] 錯排問題 (https://peienwu.com/derangement/)

    [12] Hassani, Mehdi. &quot;Derangements and applications..&quot; Journal of Integer Sequences [ electronic only ] 6.1 (2003): Art. 03.1.2, 8 p., electronic only-Art. 03.1.2, 8 p., electronic only. <http://eudml.org/doc/51444>.

    [13] Sloane, N.J.A. (編). Sequence A000166 (Subfactorial or rencontres numbers, or derangements: number of permutations of n elements with no fixed points.). The On-Line Encyclopedia of Integer Sequences. OEIS Foundation

    [14] Ismail, M.E.H., Simeonov, P. Asymptotics of generalized derangements. Adv Comput Math 39, 101–127 (2013). https://doi.org/10.1007/s10444-011-9271-7
    Description: 碩士
    國立政治大學
    應用數學系
    106751016
    Source URI: http://thesis.lib.nccu.edu.tw/record/#G0106751016
    Data Type: thesis
    Appears in Collections:[應用數學系] 學位論文

    Files in This Item:

    File Description SizeFormat
    101601.pdf1468KbAdobe PDF280View/Open


    All items in 政大典藏 are protected by copyright, with all rights reserved.


    社群 sharing

    著作權政策宣告 Copyright Announcement
    1.本網站之數位內容為國立政治大學所收錄之機構典藏,無償提供學術研究與公眾教育等公益性使用,惟仍請適度,合理使用本網站之內容,以尊重著作權人之權益。商業上之利用,則請先取得著作權人之授權。
    The digital content of this website is part of National Chengchi University Institutional Repository. It provides free access to academic research and public education for non-commercial use. Please utilize it in a proper and reasonable manner and respect the rights of copyright owners. For commercial use, please obtain authorization from the copyright owner in advance.

    2.本網站之製作,已盡力防止侵害著作權人之權益,如仍發現本網站之數位內容有侵害著作權人權益情事者,請權利人通知本網站維護人員(nccur@nccu.edu.tw),維護人員將立即採取移除該數位著作等補救措施。
    NCCU Institutional Repository is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff(nccur@nccu.edu.tw). We will remove the work from the repository and investigate your claim.
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - Feedback