WEKO3
アイテム
{"_buckets": {"deposit": "5a2cf932-7711-4a56-b278-ca4fdfb6da5e"}, "_deposit": {"created_by": 3, "id": "2057", "owners": [3], "pid": {"revision_id": 0, "type": "depid", "value": "2057"}, "status": "published"}, "_oai": {"id": "oai:aue.repo.nii.ac.jp:00002057", "sets": ["238"]}, "author_link": ["69", "118", "1800"], "item_3_biblio_info_7": {"attribute_name": "書誌事項", "attribute_value_mlt": [{"bibliographicIssueDates": {"bibliographicIssueDate": "2009-02", "bibliographicIssueDateType": "Issued"}, "bibliographicPageEnd": "123", "bibliographicPageStart": "117", "bibliographicVolumeNumber": "12", "bibliographic_titles": [{"bibliographic_title": "愛知教育大学教育実践総合センター紀要"}]}]}, "item_3_description_27": {"attribute_name": "資源タイプ", "attribute_value_mlt": [{"subitem_description": "text", "subitem_description_type": "Other"}]}, "item_3_description_5": {"attribute_name": "抄録", "attribute_value_mlt": [{"subitem_description": "The objective of this paper is to discuss various problems that arise when Japanese-language text that is circulating on the World-Wide Web (WWW)is utilized as a corpus. First of all, our review of previous research relating to Japanese-language corpora showed that research into the application of the WWW as a Japanese-language corpus has still not been tackled sufficiently. We then studied all of the research papers that were presented at national conventions over a two-year period for one Japanese academic society relating to information and education. As a result, it became clear that although there have been several research projects into the use of text-mining methods, there has been almost no research relating to WWW Japanese-language corpora. In the light of these findings, we considered the various problems that might arise during research into WWW Japanese-language corpora. In other words, some of the points that we need to consider include: 1) sample bias, 2)the self-images projected by authors, 3)proof of the validity of such contents, 4)the large numbers of submissions by the same people, 5)the fact that data management including the revision and update of such contents is often done at the individual level, and 6)plagiarism of written works and quoting from other sites. Thus, although sample bias does remain, we can say that the Internet gives us the first opportunity in history to accumulate vast quantities of personally-published data. We have become able to utilize the Internet both quantitatively and qualitatively as a modern intellectual resource. Once we have clarified suitable methods for using this intellectual resource as a target of research in analysis, we should be able to engage in research towards structuring a WWW Japanese-language corpus in the future.", "subitem_description_type": "Abstract"}]}, "item_3_link_3": {"attribute_name": "研究者総覧へのリンク", "attribute_value_mlt": [{"subitem_link_text": "Nozaki, Hironari", "subitem_link_url": "https://souran.aichi-edu.ac.jp/teachers/83ea2bcba0bf3673.html"}, {"subitem_link_text": "Umeda, Kyoko", "subitem_link_url": "https://souran.aichi-edu.ac.jp/teachers/c345ac14a396c280.html"}]}, "item_3_publisher_8": {"attribute_name": "出版者", "attribute_value_mlt": [{"subitem_publisher": "愛知教育大学教育実践総合センター"}]}, "item_3_source_id_11": {"attribute_name": "書誌レコードID", "attribute_value_mlt": [{"subitem_source_identifier": "AA11232717", "subitem_source_identifier_type": "NCID"}]}, "item_3_source_id_9": {"attribute_name": "ISSN", "attribute_value_mlt": [{"subitem_source_identifier": "1344-2597", "subitem_source_identifier_type": "ISSN"}]}, "item_3_text_10": {"attribute_name": "書誌情報", "attribute_value_mlt": [{"subitem_text_value": "愛知教育大学教育実践総合センター紀要. 2009, 12 p. 117-123."}]}, "item_3_text_26": {"attribute_name": "著者別名", "attribute_value_mlt": [{"subitem_text_value": "ノザキ, ヒロナリ"}, {"subitem_text_value": "トダ, カズユキ"}, {"subitem_text_value": "ウメダ, キョウコ"}]}, "item_3_text_29": {"attribute_name": "旧登録日", "attribute_value_mlt": [{"subitem_text_value": "2009-06-08T01:19:47Z"}]}, "item_3_text_30": {"attribute_name": "旧公開日", "attribute_value_mlt": [{"subitem_text_value": "2009-06-08T01:19:47Z"}]}, "item_3_text_31": {"attribute_name": "旧ソートキー", "attribute_value_mlt": [{"subitem_text_value": "14"}]}, "item_3_text_4": {"attribute_name": "著者(別言語)", "attribute_value_mlt": [{"subitem_text_value": "野崎, 浩成"}, {"subitem_text_value": "戸田, 和幸"}, {"subitem_text_value": "梅田, 恭子"}]}, "item_3_version_type_14": {"attribute_name": "著者版フラグ", "attribute_value_mlt": [{"subitem_version_resource": "http://purl.org/coar/version/c_970fb48d4fbd8a85", "subitem_version_type": "VoR"}]}, "item_creator": {"attribute_name": "著者", "attribute_type": "creator", "attribute_value_mlt": [{"creatorNames": [{"creatorName": "Nozaki, Hironari"}], "nameIdentifiers": [{"nameIdentifier": "118", "nameIdentifierScheme": "WEKO"}, {"nameIdentifier": "80275148", "nameIdentifierScheme": "e-Rad", "nameIdentifierURI": "https://kaken.nii.ac.jp/ja/search/?qm=80275148"}, {"nameIdentifier": "80275148", "nameIdentifierScheme": "KAKEN - 研究者検索", "nameIdentifierURI": "https://nrid.nii.ac.jp/ja/nrid/1000080275148/"}]}, {"creatorNames": [{"creatorName": "Toda, Kazuyuki"}], "nameIdentifiers": [{"nameIdentifier": "1800", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "Umeda, Kyoko"}], "nameIdentifiers": [{"nameIdentifier": "69", "nameIdentifierScheme": "WEKO"}, {"nameIdentifier": "70345940", "nameIdentifierScheme": "e-Rad", "nameIdentifierURI": "https://kaken.nii.ac.jp/ja/search/?qm=70345940"}, {"nameIdentifier": "70345940", "nameIdentifierScheme": "KAKEN - 研究者検索", "nameIdentifierURI": "https://nrid.nii.ac.jp/ja/nrid/1000070345940/"}]}]}, "item_files": {"attribute_name": "ファイル情報", "attribute_type": "file", "attribute_value_mlt": [{"accessrole": "open_date", "date": [{"dateType": "Available", "dateValue": "2017-03-27"}], "displaytype": "detail", "download_preview_message": "", "file_order": 0, "filename": "jissenkiyo12117123.pdf", "filesize": [{"value": "183.3 kB"}], "format": "application/pdf", "future_date_message": "", "is_thumbnail": false, "licensetype": "license_free", "mimetype": "application/pdf", "size": 183300.0, "url": {"label": "jissenkiyo12117123.pdf", "url": "https://aue.repo.nii.ac.jp/record/2057/files/jissenkiyo12117123.pdf"}, "version_id": "fe93b063-38f2-4eb1-8e90-86edd29939de"}]}, "item_keyword": {"attribute_name": "キーワード", "attribute_value_mlt": [{"subitem_subject": "Japanese-Language Corpus", "subitem_subject_scheme": "Other"}, {"subitem_subject": "Loan Words", "subitem_subject_scheme": "Other"}, {"subitem_subject": "Text-mining Methods", "subitem_subject_scheme": "Other"}]}, "item_language": {"attribute_name": "言語", "attribute_value_mlt": [{"subitem_language": "eng"}]}, "item_resource_type": {"attribute_name": "資源タイプ", "attribute_value_mlt": [{"resourcetype": "departmental bulletin paper", "resourceuri": "http://purl.org/coar/resource_type/c_6501"}]}, "item_title": "Various Problems Concerning the Construction of a WWW Japanese-Language Corpus― The Current State and Future Prospects of Japanese-Language Corpus Research ―", "item_titles": {"attribute_name": "タイトル", "attribute_value_mlt": [{"subitem_title": "Various Problems Concerning the Construction of a WWW Japanese-Language Corpus― The Current State and Future Prospects of Japanese-Language Corpus Research ―", "subitem_title_language": "en"}]}, "item_type_id": "3", "owner": "3", "path": ["238"], "permalink_uri": "http://hdl.handle.net/10424/1920", "pubdate": {"attribute_name": "公開日", "attribute_value": "2009-06-08"}, "publish_date": "2009-06-08", "publish_status": "0", "recid": "2057", "relation": {}, "relation_version_is_last": true, "title": ["Various Problems Concerning the Construction of a WWW Japanese-Language Corpus― The Current State and Future Prospects of Japanese-Language Corpus Research ―"], "weko_shared_id": 3}
Various Problems Concerning the Construction of a WWW Japanese-Language Corpus― The Current State and Future Prospects of Japanese-Language Corpus Research ―
http://hdl.handle.net/10424/1920
http://hdl.handle.net/10424/19208f7d0433-10a6-4ae4-84ba-9d2b9cb6ef80
名前 / ファイル | ライセンス | アクション |
---|---|---|
jissenkiyo12117123.pdf (183.3 kB)
|
|
Item type | 紀要論文 / Departmental Bulletin Paper(1) | |||||
---|---|---|---|---|---|---|
公開日 | 2009-06-08 | |||||
タイトル | ||||||
言語 | en | |||||
タイトル | Various Problems Concerning the Construction of a WWW Japanese-Language Corpus― The Current State and Future Prospects of Japanese-Language Corpus Research ― | |||||
言語 | ||||||
言語 | eng | |||||
キーワード | ||||||
主題Scheme | Other | |||||
主題 | Japanese-Language Corpus | |||||
キーワード | ||||||
主題Scheme | Other | |||||
主題 | Loan Words | |||||
キーワード | ||||||
主題Scheme | Other | |||||
主題 | Text-mining Methods | |||||
資源タイプ | ||||||
資源タイプ識別子 | http://purl.org/coar/resource_type/c_6501 | |||||
資源タイプ | departmental bulletin paper | |||||
著者 |
Nozaki, Hironari
× Nozaki, Hironari× Toda, Kazuyuki× Umeda, Kyoko |
|||||
研究者総覧へのリンク | ||||||
Nozaki, Hironari | ||||||
https://souran.aichi-edu.ac.jp/teachers/83ea2bcba0bf3673.html | ||||||
研究者総覧へのリンク | ||||||
Umeda, Kyoko | ||||||
https://souran.aichi-edu.ac.jp/teachers/c345ac14a396c280.html | ||||||
著者(別言語) | ||||||
野崎, 浩成 | ||||||
著者(別言語) | ||||||
戸田, 和幸 | ||||||
著者(別言語) | ||||||
梅田, 恭子 | ||||||
抄録 | ||||||
内容記述タイプ | Abstract | |||||
内容記述 | The objective of this paper is to discuss various problems that arise when Japanese-language text that is circulating on the World-Wide Web (WWW)is utilized as a corpus. First of all, our review of previous research relating to Japanese-language corpora showed that research into the application of the WWW as a Japanese-language corpus has still not been tackled sufficiently. We then studied all of the research papers that were presented at national conventions over a two-year period for one Japanese academic society relating to information and education. As a result, it became clear that although there have been several research projects into the use of text-mining methods, there has been almost no research relating to WWW Japanese-language corpora. In the light of these findings, we considered the various problems that might arise during research into WWW Japanese-language corpora. In other words, some of the points that we need to consider include: 1) sample bias, 2)the self-images projected by authors, 3)proof of the validity of such contents, 4)the large numbers of submissions by the same people, 5)the fact that data management including the revision and update of such contents is often done at the individual level, and 6)plagiarism of written works and quoting from other sites. Thus, although sample bias does remain, we can say that the Internet gives us the first opportunity in history to accumulate vast quantities of personally-published data. We have become able to utilize the Internet both quantitatively and qualitatively as a modern intellectual resource. Once we have clarified suitable methods for using this intellectual resource as a target of research in analysis, we should be able to engage in research towards structuring a WWW Japanese-language corpus in the future. | |||||
書誌事項 |
愛知教育大学教育実践総合センター紀要 巻 12, p. 117-123, 発行日 2009-02 |
|||||
出版者 | ||||||
出版者 | 愛知教育大学教育実践総合センター | |||||
ISSN | ||||||
収録物識別子タイプ | ISSN | |||||
収録物識別子 | 1344-2597 | |||||
書誌情報 | ||||||
愛知教育大学教育実践総合センター紀要. 2009, 12 p. 117-123. | ||||||
書誌レコードID | ||||||
収録物識別子タイプ | NCID | |||||
収録物識別子 | AA11232717 | |||||
著者版フラグ | ||||||
出版タイプ | VoR | |||||
出版タイプResource | http://purl.org/coar/version/c_970fb48d4fbd8a85 | |||||
著者別名 | ||||||
ノザキ, ヒロナリ | ||||||
著者別名 | ||||||
トダ, カズユキ | ||||||
著者別名 | ||||||
ウメダ, キョウコ | ||||||
資源タイプ | ||||||
内容記述タイプ | Other | |||||
内容記述 | text |