Document similarity search is to find documents similar to a given query document and return a ranked list of similar documents to users , which is widely used in many text and web systems , such as digital library , search engine , etc . traditional retrieval models , including the okapi s bm25 model and the smart s vector space model with length normalization , could handle this problem to some extent by taking the query document as a long query 文档相似搜索指从文档集中检索与给定查询文档相似的文档。对于给定的查询文档,我们期望文档相似搜索系统能够返回一个按相似度排序的相似文档列表。文档相似搜索技术已经被广泛应用到电子图书馆,搜索引擎等系统中,例如citeseer . ist科学文献数字图书馆的相似文献推荐功能, google的相似网页查询功能等。
It has also played an important role in the practice , which makes it necessary for such units as presses , libraries and information centers to provide services with publications . according to this theory , people divided the science information communication into two processes , the formal one and the informal one 该理论对实践的作用也是巨大的,使借助于科学文献提供服务的单位,包括编辑、出版、印刷、发行单位和图书馆、情报中心等单位一直在科学情报交流系统中占据着不可替代的位置。
科学: science; scientific knowledg ...文献: document; literature科学文献集: archives des sciences内科学文献: archives of internal medicine外科学文献: archives of surgery眼科学文献: archives of ophthalmology地球科学文献: geoscience documentation定量科学文献: quantitative scientific literature耳喉科学文献: archives of otolaryngology科学文献索引: science citation index人文科学文献索引: humanities index音乐科学文献集: archiv fur musikwissenschaft东方科学哲学文献选读: readings in oriental sci. & philosophic wri社会科学文献出版社: publishing house of china's social sciences社会科学文献协调委员会: ation in the social sciences中国科学文献数据库: chinese science document database病理学文献: archives of pathology东方学文献: orientalistische literaturzeitung放射学文献: radiologic literature化学文献: chemical document; chemical literature searching; scientific literature in chemistry经济学文献: economic literature数学文献: archiv der mathematik; archives mathematiques; archives of mathematics文学文献学: science of literary literature医学文献: medical literature医学文献库: medical library