Predicted tertiary structure showed that thl2 protein possessed a domain , and the srk - binding site cppc is located in tertiary structure surface , which provided the spacial foundation for thl2 binding srk kinase 預(yù)測的thl2蛋白質(zhì)高級結(jié)構(gòu)中有一個結(jié)構(gòu),并且其與srk結(jié)合的活性位點(diǎn)cppc位于蛋白質(zhì)的表面,這為thl2蛋白與srk激酶結(jié)合提供了空間上的基礎(chǔ)。
In the preprocessing stage the method of user and session identification often adopt heuristic algorithm for the being of cache and agent . this induce the uncertainty of data resource . the cppc algorithm avoid the limitation and has no use for complicated hash data structure . in this algorithm , by constructing a userld - url revelant matrix similar customer groups are discovered by measuring similarity between column vectors and relevant web pages are obtained by measuring similarity between row vectors ; frequent access paths can also be discovered by further processing of the latter . experiments show the effectiveness of the algorithm . in the fourth part , this thesis bring some key techniques of data mining into web usage mining , combine the characteristic of relation database design and implement a web usage mining system wlgms with function of visible . lt can provide the user with decision support , and has good practicability 本文算法避免了這個缺陷,且不需要復(fù)雜的hash數(shù)據(jù)結(jié)構(gòu),通過構(gòu)造一個userid - uel關(guān)聯(lián)矩陣,對列向量進(jìn)行相似性分析得到相似客戶群體,對行向量進(jìn)行相似性度量獲得相關(guān)web頁面,對后者再進(jìn)一步處理得到頻繁訪問路徑。實(shí)驗(yàn)結(jié)果表明了算法的有效性。第四是本文將傳統(tǒng)數(shù)據(jù)挖掘過程中的各種關(guān)鍵技術(shù),引入到對web使用信息的挖掘活動中,結(jié)合關(guān)系數(shù)據(jù)庫的特點(diǎn)設(shè)計(jì)并實(shí)現(xiàn)了一個具有可廣西人學(xué)頎士學(xué)位論義視化功能的web使用挖掘系統(tǒng)wlgms 。
It also is called great pioneering work of copy green . after invention , it has attracted much attentions from government officers , scientists , and press circles . they included : mao rubai , president of ningxia hui autonomous district ; ren qixing , head of ningxia cppc ; zhou shengxian , president of nation forestry bureau and xu yuexian , subdecanal of china agriculture science academy ; liu zhong , vice chairmen of the autonomous region and officers of nstc ningxia science and technique committee and of the yinchuan government 過去由于糧食不夠,把山上的樹都砍掉,種上糧食現(xiàn)在我們糧食已經(jīng)富馀,完全可以無償向農(nóng)民提供糧食,讓他們把山上種的糧田退出來,種上樹,或者種草,也就是退耕還林退耕還草退耕還湖,使西部地區(qū)有一個非常好的美麗的生態(tài)環(huán)境,有一個能吸引外國投資的好環(huán)境。
This thesis includes four parts in which the technologies of web usage mininig are systematically researched . in the first part we summarize the techniques of data mining and web usage mining , present the significance of the research on web usage mininig , the status of research and the problem which web usage mininig will face with . in the second part we discuss the web usage mininig according to the process of web mining . in the stage of data preparing and preprocessing we discuss the algorithm of data cleaning , user and session identification in detail , and present a data model of association rules and sequential patterns in the stage of pattern discovery , discuss the useful method of pattern analysis in last stage . a synthesis clustering algorithm cppc is proposed in the third part of this thesis 本文分主要從以下四個方面對web使用挖掘進(jìn)行了系統(tǒng)的分析和研究。第一是對數(shù)據(jù)挖掘和web挖掘進(jìn)行了概述,闡述了web挖掘的意義、研究的現(xiàn)狀、面臨的問題。第二是討論了web使用挖掘的三個階段:在數(shù)據(jù)準(zhǔn)備和預(yù)處理階段重點(diǎn)討論了數(shù)據(jù)清洗及用戶和會話識別算法;在模式發(fā)現(xiàn)階段定義了關(guān)聯(lián)規(guī)則和序列模式的數(shù)據(jù)模型;模式分析階段則討論了現(xiàn)行的幾種分析方法。
In thl1 gene , there are some famililar enzyme digest sites including bamh i , hind , sac i and sal i . the sequence of thl1l from brassica oleracea l . shows 99 % identity to the sequence of thll from brassica napus l . there are some phosphorylation sites and an active site of thoiredoxin h members consisting of cppc in thll protein 甘藍(lán)和油菜thl1基因序列有5個堿基的差異,而氨基酸僅有2個氨基酸的變異,同源性達(dá)99 。在thl1蛋白的氨基酸序列中發(fā)現(xiàn)了一個硫氧還蛋白家族的活性位點(diǎn)( cppc )和多個磷酸化位點(diǎn)。