99999久久久久久亚洲,欧美人与禽猛交狂配,高清日韩av在线影院,一个人在线高清免费观看,啦啦啦在线视频免费观看www

<table id="u3xn2"></table><nobr id="u3xn2"></nobr>

熱線電話：13121318867

登錄

首頁精彩閱讀R語言處理XML文件

R語言處理XML文件

2017-06-18

收藏

R語言處理XML文件

XML是分享的文件格式在萬維網(wǎng)，內(nèi)聯(lián)網(wǎng)中的數(shù)據(jù)，和其他地方使用標(biāo)準(zhǔn)ASCII文本的文件格式。它代表著可擴展標(biāo)記語言(XML)。類似于 HTML 包含標(biāo)記標(biāo)簽。但不同于HTML標(biāo)記標(biāo)簽描述了頁面的結(jié)構(gòu)，xml標(biāo)記標(biāo)簽中包含自己的文件中的數(shù)據(jù)含義。

可以通過使用R中的“XML”包來讀取XML文件。可以用下面的命令來安裝該軟件包。

install.packages("XML")

輸入數(shù)據(jù)

通過下面的數(shù)據(jù)復(fù)制到記事本等文本編輯器創(chuàng)建一個XML文件。保存為一個帶有 .xml 擴展名的文件，并選擇文件類型為所有文件（*.*）。

<RECORDS>
<EMPLOYEE>
    <ID>1</ID>
    <NAME>Rick</NAME>
    <SALARY>623.3</SALARY>
    <STARTDATE>1/1/2012</STARTDATE>
    <DEPT>IT</DEPT>
</EMPLOYEE>
<EMPLOYEE>
    <ID>2</ID>
    <NAME>Dan</NAME>
    <SALARY>515.2</SALARY>
    <STARTDATE>9/23/2013</STARTDATE>
    <DEPT>Operations</DEPT>
</EMPLOYEE>
<EMPLOYEE>
    <ID>3</ID>
    <NAME>Michelle</NAME>
    <SALARY>611</SALARY>
    <STARTDATE>11/15/2014</STARTDATE>
    <DEPT>IT</DEPT>
</EMPLOYEE>
<EMPLOYEE>
    <ID>4</ID>
    <NAME>Ryan</NAME>
    <SALARY>729</SALARY>
    <STARTDATE>5/11/2014</STARTDATE>
    <DEPT>HR</DEPT>
</EMPLOYEE>
<EMPLOYEE>
    <ID>5</ID>
    <NAME>Gary</NAME>
    <SALARY>843.25</SALARY>
    <STARTDATE>3/27/2015</STARTDATE>
    <DEPT>Finance</DEPT>
</EMPLOYEE>
<EMPLOYEE>
    <ID>6</ID>
    <NAME>Nina</NAME>
    <SALARY>578</SALARY>
    <STARTDATE>5/21/2013</STARTDATE>
    <DEPT>IT</DEPT>
</EMPLOYEE>
<EMPLOYEE>
    <ID>7</ID>
    <NAME>Simon</NAME>
    <SALARY>632.8</SALARY>
    <STARTDATE>7/30/2013</STARTDATE>
    <DEPT>Operations</DEPT>
</EMPLOYEE>
<EMPLOYEE>
    <ID>8</ID>
    <NAME>Guru</NAME>
    <SALARY>722.5</SALARY>
    <STARTDATE>6/17/2014</STARTDATE>
    <DEPT>Finance</DEPT>
</EMPLOYEE>
</RECORDS>

讀取XML文件

XML文件是由R使用函數(shù)XMLPARSE()讀取。它存儲為R語言中的列表，如下所示：

# Load the package required to read XML files.
library("XML")

# Also load the other required package.
library("methods")

# Give the input file name to the function.
result <- xmlParse(file="input.xml")

# Print the result.
print(result)

當(dāng)我們上面的代碼執(zhí)行時，它產(chǎn)生以下結(jié)果：

1
    Rick
    623.3
    1/1/2012
    IT

    2
    Dan
    515.2
    9/23/2013
    Operations

    3
    Michelle
    611
    11/15/2014
    IT

    4
    Ryan
    729
    5/11/2014
    HR

    5
    Gary
    843.25
    3/27/2015
    Finance

    6
    Nina
    578
    5/21/2013
    IT

    7
    Simon
    632.8
    7/30/2013
    Operations

    8
    Guru
    722.5
    6/17/2014
    Finance

獲取目前在XML文件的節(jié)點數(shù)量

# Load the packages required to read XML files.
library("XML")
library("methods")

# Give the input file name to the function.
result <- xmlParse(file="input.xml")

# Exract the root node form the xml file.
rootnode <- xmlRoot(result)

# Find number of nodes in the root.
rootsize <- xmlSize(rootnode)

# Print the result.
print(rootsize)

當(dāng)我們上面的代碼執(zhí)行時，它產(chǎn)生以下結(jié)果：

output
[1] 8

第一個節(jié)點的細節(jié)

讓我們來看看在解析文件的第一條記錄。它會給我們存在于頂層節(jié)點的各種元素的詳細。

# Load the packages required to read XML files.
library("XML")
library("methods")

# Give the input file name to the function.
result <- xmlParse(file="input.xml")

# Exract the root node form the xml file.
rootnode <- xmlRoot(result)

# Print the result.
print(rootnode[1])

當(dāng)我們上面的代碼執(zhí)行時，它產(chǎn)生以下結(jié)果：

$EMPLOYEE1Rick623.31/1/2012ITattr(,"class")
[1] "XMLInternalNodeList" "XMLNodeList"

獲取一個節(jié)點的不同元素

# Load the packages required to read XML files.
library("XML")
library("methods")

# Give the input file name to the function.
result <- xmlParse(file="input.xml")

# Exract the root node form the xml file.
rootnode <- xmlRoot(result)

# Get the first element of the first node.
print(rootnode[[1]][[1]])

# Get the fifth element of the first node.
print(rootnode[[1]][[5]])

# Get the second element of the third node.
print(rootnode[[3]][[2]])

當(dāng)我們上面的代碼執(zhí)行時，它產(chǎn)生以下結(jié)果：

1ITMichelle

XML到數(shù)據(jù)幀

為了有效地處理大型文件中的數(shù)據(jù)，我們將XML文件中讀出的數(shù)據(jù)作為數(shù)據(jù)幀。然后處理進行數(shù)據(jù)分析的數(shù)據(jù)幀。

# Load the packages required to read XML files.
library("XML")
library("methods")

# Convert the input xml file to a data frame.
xmldataframe <- xmlToDataFrame("input.xml")
print(xmldataframe)

當(dāng)我們上面的代碼執(zhí)行時，它產(chǎn)生以下結(jié)果：

ID     NAME SALARY STARTDATE       DEPT
1 1     Rick 623.3   1/1/2012         IT
2 2      Dan 515.2 9/23/2013 Operations
3 3 Michelle    611 11/15/2014         IT
4 4     Ryan    729 5/11/2014         HR
5 5     Gary 843.25 3/27/2015    Finance
6 6     Nina    578 5/21/2013         IT
7 7    Simon 632.8 7/30/2013 Operations
8 8     Guru 722.5 6/17/2014    Finance

由于數(shù)據(jù)現(xiàn)在可以作為一個數(shù)據(jù)幀，我們可以用數(shù)據(jù)幀的相關(guān)函數(shù)讀取和處理的文件。

CDA數(shù)據(jù)分析師考試相關(guān)入口一覽（建議收藏）：

? 想報名CDA認(rèn)證考試，點擊>>> “CDA報名” 了解CDA考試詳情；

? 想學(xué)習(xí)CDA考試教材，點擊>>> “CDA教材” 了解CDA考試詳情；

? 想加入CDA考試題庫，點擊>>> “CDA題庫” 了解CDA考試詳情；

? 想了解CDA考試含金量，點擊>>> “CDA含金量” 了解CDA考試詳情；

R語言 DataFrame 數(shù)據(jù)分析

數(shù)據(jù)分析咨詢請掃描二維碼

若不方便掃碼，搜微信號：CDAshujufenxi

上一篇圖論在大數(shù)據(jù)分析中的作用！

下一篇CDA認(rèn)證再升一檔！與國家共同推進大數(shù)據(jù)人才培養(yǎng)標(biāo)準(zhǔn)教育事業(yè)！

數(shù)據(jù)分析師考試動態(tài)

考試介紹
考試大綱
考試內(nèi)容
考試地點

CDA報考指南

報考流程
考試時間
報名費用
聯(lián)系我們

數(shù)據(jù)分析學(xué)習(xí)

數(shù)據(jù)分析師資訊

更多

Copyright © 2015-2021, www.3lll3.cn All Rights Reserved. CDA數(shù)據(jù)分析師(北京國富如荷網(wǎng)絡(luò)科技有限公司) 版權(quán)所有京ICP備11001960號-9

京公網(wǎng)安備 11010802034615號經(jīng)營許可證編號：京B2-20210330

聯(lián)系電話：13321103290 (微信同號)

OK

CDA教材
CDA題庫
CDA大綱

客服在線

客服在線

立即咨詢

免密碼登錄

提交首次登錄驗證后自動注冊

') } function initGt() { var handler = function (captchaObj) { captchaObj.appendTo('#captcha'); captchaObj.onReady(function () { $("#wait").hide(); }).onSuccess(function(){ $('.getcheckcode').removeClass('dis'); $('.getcheckcode').trigger('click'); }); window.captchaObj = captchaObj; }; $('#captcha').show(); $.ajax({ url: "/login/gtstart?t=" + (new Date()).getTime(), // 加隨機數(shù)防止緩存 type: "get", dataType: "json", success: function (data) { $('#text').hide(); $('#wait').show(); // 調(diào)用 initGeetest 進行初始化 // 參數(shù)1：配置參數(shù) // 參數(shù)2：回調(diào)，回調(diào)的第一個參數(shù)驗證碼對象，之后可以使用它調(diào)用相應(yīng)的接口 initGeetest({ // 以下 4 個配置參數(shù)為必須，不能缺少 gt: data.gt, challenge: data.challenge, offline: !data.success, // 表示用戶后臺檢測極驗服務(wù)器是否宕機 new_captcha: data.new_captcha, // 用于宕機時表示是新驗證碼的宕機 product: "float", // 產(chǎn)品形式，包括：float，popup width: "280px", https: true // 更多配置參數(shù)說明請參見：http://docs.geetest.com/install/client/web-front/ }, handler); } }); } function codeCutdown() { if(_wait == 0){ //倒計時完成 $(".getcheckcode").removeClass('dis').html("重新獲取"); }else{ $(".getcheckcode").addClass('dis').html("重新獲取("+_wait+"s)"); _wait--; setTimeout(function () { codeCutdown(); },1000); } } function inputValidate(ele,telInput) { var oInput = ele; var inputVal = oInput.val(); var oType = ele.attr('data-type'); var oEtag = $('#etag').val(); var oErr = oInput.closest('.form_box').next('.err_txt'); var empTxt = '請輸入'+oInput.attr('placeholder')+'！'; var errTxt = '請輸入正確的'+oInput.attr('placeholder')+'！'; var pattern; if(inputVal==""){ if(!telInput){ errFun(oErr,empTxt); } return false; }else { switch (oType){ case 'login_mobile': pattern = /^1[3456789]\d{9}$/; if(inputVal.length==11) { $.ajax({ url: '/login/checkmobile', type: "post", dataType: "json", data: { mobile: inputVal, etag: oEtag, page_ur: window.location.href, page_referer: document.referrer }, success: function (data) { } }); } break; case 'login_yzm': pattern = /^\d{6}$/; break; } if(oType=='login_mobile'){ } if(!!validateFun(pattern,inputVal)){ errFun(oErr,'') if(telInput){ $('.getcheckcode').removeClass('dis'); } }else { if(!telInput) { errFun(oErr, errTxt); }else { $('.getcheckcode').addClass('dis'); } return false; } } return true; } function errFun(obj,msg) { obj.html(msg); if(msg==''){ $('.login_submit').removeClass('dis'); }else { $('.login_submit').addClass('dis'); } } function validateFun(pat,val) { return pat.test(val); }