Note that files are enumerated , and each file is associated with an integer " fileid . 請(qǐng)注意文檔是編號(hào)的,每個(gè)文檔都關(guān)聯(lián)一個(gè)整數(shù)" fileid " 。
Python dictionary lookups are very efficient , and relatively little additional work goes into connecting " fileid " s with the actual filenames Python詞典的查找效率很高,而且連結(jié)" fileid "和實(shí)際文件名的附加工作相對(duì)很少。
Rid : an identifier in the format fileid : pagenumber : rid , where fileid identifies the file containing the page , pagenumber identifies the page containing the row , and rid identifies the specific row on the page Rid :格式為fileid : pagenumber : rid的標(biāo)識(shí)符,其中fileid標(biāo)識(shí)包含頁(yè)的文件, pagenumber標(biāo)識(shí)包含行的頁(yè), rid標(biāo)識(shí)頁(yè)上的特定行。
Maintains a python dictionary whose keys are words and whose values are themselves dictionaries whose keys are " fileid " s and whose values are simply a count of how many times the word occurs in the file indicated by the " fileid " 有一個(gè)python詞典,其關(guān)鍵字為詞語(yǔ),其值本身又是詞典,這個(gè)詞典的關(guān)鍵字為" fileid " ,其值為" fileid "指定的詞語(yǔ)在文件中的出現(xiàn)次數(shù)。