友情提示:如果本网页打开太慢或显示不完整,请尝试鼠标右键“刷新”本网页!
第三电子书 返回本书目录 加入书签 我的书架 我的书签 TXT全本下载 『收藏到我的浏览器』

VB2008从入门到精通(PDF格式英文版)-第69部分

快捷操作: 按键盘上方向键 ← 或 → 可快速上下翻页 按键盘上的 Enter 键可回到本书目录页 按键盘上方向键 ↑ 可回到本页顶部! 如果本书没有阅读完,想下次继续接着阅读,可使用上方 "收藏到我的浏览器" 功能 和 "加入书签" 功能!


    Implements IProcessor。Process 

        Dim reader As TextReader = New StringReader(input) 

        Dim retval As New StringBuilder() 


…………………………………………………………Page 286……………………………………………………………

264       CH AP T E R   1 0   ■    L E A R N I N G   A B OU T   P E R S IS TE N CE 



                   Do While reader。Peek()  …1 

                       Dim splitUpText As String() = _ 

                         reader。ReadLine。Split(New Char() {〃 〃c; ControlChars。Tab}) 

                       Dim c1 As Integer 

                       For c1 = 0 To splitUpText。Length 1 

                           retval。Append((〃(〃 & splitUpText(c1) & 〃)〃)) 

                       Next 

                       retval。Append(ControlChars。NewLine) 

                   Loop 

                   Return retval。ToString() 

               End Function 

           End Class 



                In the implementation of Process(); the text will be parsed line by line。 Then each line  

           is split into the individual fields。 You could write the parsing routines yourself; but to parse a  

           buffer line by line; it is more efficient to use StringReader。 StringReader accepts the string to  

           parse and is then assigned to a TextReader interface instance。 

                As each line of text is parsed; the most efficient approach to building a buffer is to use  

           StringBuilder。 You could keep appending data to the string; but if you do that too often the  

           application’s performance will suffer。  

                The String type is an immutable type; which means once an object is initialized; you  

           cannot change the state of the object。 The advantage of immutable types is that they increase  

           the speed of your application; because code can assume once an object has been assigned; it  

           will never change。 The downside is that once an object is assigned; to modify the object state  

           even slightly; you must instantiate a new object; which would be the case if we used the = and  

           ± operators。 The  StringBuilder type is like String; except the referenced text can be modified。 

                In the Process() implementation; the Do While loop calls the method Peek(); which reads;  

           but does not remove; a character value from the stream。 If there is nothing more to read; a …1  

           value is returned。 Otherwise; data is available; and the method  ReadLine() can be called。  

           ReadLine() will read a buffer of characters until a newline or return character is encountered。  

           Having read a line of text; it is split into the individual fields using the Split() method。 The split  

           characters are the space and tab character (ControlChars。Tab)。 

                When the Split() method returns; the individual fields are assigned to the array splitUpText。  

           Those array elements are iterated and appended to the StringBuilder variable retval; but each  

           element is surrounded by a set of brackets。 The brackets provide a set of boundaries that you  

           can inspect to see what data has been found。 I include the brackets purely for debugging purposes。  

           Because I am trying to reformat the stream; I append a newline character (ControlChars。NewLine) to  

           the variable retval。 

                When all of the lines of text and fields within the lines of text are iterated; a string represen

           tation of the StringBuilder instance is returned using the ToString() method。 Running the  

           code shows how many fields each line of text has and how you should format the text file。 This  

           gives you an understanding of how the file is structured。  

                The following is sample output from the lotto。txt file。 


…………………………………………………………Page 287……………………………………………………………

                                                        CH A PT E R   1 0   ■    L E A R N I N G   A B O U T  P E R S IS T E N CE 265 



(2000。01。15)(6)(10)(25)(26)(38)(42)(20) 

(2000。01。19)(2)(16)(18)(23)(32)(43)(26) 

(2000。01。22)(4)(5)(6)(24)(34)(38)(9) 

(2000。01。26)(3)(20)(22)(24)(34)(39)(9) 

(2000。01。29)(7)(12)(13)(34)(38)(39)(28) 

(2000。02。02)(1)(18)(22)(28)(35)(43)(32) 

(2000。02。05)(4)(13)(15)(31)(32)(45)(37) 

(2000。02。09)(1)(29)(31)(34)(39)(41)(25) 

。。。 

(2006…12…27)(11)(13)(17)(21)(24)(26)(38)(578199)(735993)()() 

(2006…12…30)(3)(13)(22)(30)(35)(41)(34)(142968)(472679)() 

() 

() 

() 

(2007…01…03)(5)(24)(37)(39)(41)(44)(9)(049802)(133875)()() 

(2007…01…06)(3)(7)(23)(27)(30)(32)(38)(687442)(874814)()() 

(2007…01…10)(7)(9)(13)(23)(35)(37)(25)(039498)(648301)()() 

(2007…01…13)(3)(17)(22)(37)(39)(43)(34)(968842)(162860)()() 

(2007…01…17)(12)(16)(27)(33)(37)(41)(24)(663824)(765917)()() 



     The sample output shows that we have the following items to fix: 



     o  There are empty lines of text where no data has been defined。 



     o  Some lines of text have empty fields at the end。 



     o  Some fields have an incorrect date format。 



     o  Some dates have duplicates; which need to be removed。 



     o  Some lines of text have too many fields。 We need to figure out which fields we want to  

        keep and which we can discard。 



■Note  When processing streams and cleaning them up; it is important to take the stream apart first and  

see what you are up against。 Do not make assumptions until you have looked at the individual pieces of data。  

Then you will be able to determine the steps you need to undertake to fix the stream。 



Fixing the Stream 



The final solution uses the same code used to parse the lines of text and individual fields; as  

follows (note; however; that we need the individual lines if the date format is correct; so we  

store each one in the lineOfText variable): 


…………………………………………………………Page 288……………………………………………………………

266      CH AP T E R   1 0   ■    L E A R N I N G   A B OU T   P E R S IS TE N CE 



          Public Class LottoTicketProcessor : Implements IProcessor 

              Private _dates As IList(Of String) = New List(Of String)() 

              。 。 。 

              Public Function Process(ByVal input As String) As String 

              Implements IProcessor。Process 

                  Dim reader As TextReader = New StringReader(input) 

                  Dim retval As StringBuilder = New StringBuilder() 

                  Do While reader。Peek()  …1 

                      Dim lineOfText As String = reader。ReadLine() 

                      Dim splitUpText As String() = 

                        lineOfText。Split(New Char() {〃 〃c; ControlChars。Tab}) 

                      If _dates。Contains(splitUpText(0)) Then 

                          Continue Do 

                      End If 

                      If splitUpText(0)。Length = 0 Then 

                          Continue Do 

                      End If 

                      If splitUpText(0)。Contains(〃…〃) Then 

                          Dim dateSplit As String() = splitUpText(0)。Split(New Char() {〃…〃c}) 

                          Dim newDate As String = 

                              dateSplit(0) & 〃。〃 & dateSplit(1) & 〃。〃 & dateSplit(2) 

                          If _dates。Contains(newDate) Then 

                              Continue Do 

                          End If 

                          _dates。Add(newDate) 

                          retval。Append(newDate) 

                          For c1 As Integer = 0 To 7 

                              retval。Append(〃 〃 & splitUpText(c1)) 

                          Next 

                      Else 

                          _dates。Add(splitUpText(0)) 

                          retval。Append(lineOfText) 

                      End If 

                      retval。Append(ControlChars。NewLine) 

                  Loop 

                  Return retval。ToString() 

              End Function 

              。 。 。 

          End Class 



          ■Note  In the downloadable source code; the individual steps taken to clean up the data stream are demon

          strated。 For reference; the intermediate development steps in the source code are called Process01()  

          through  Process05()。 



               Let’s review how this code fixes the five problems we discovered。 


…………………………………………………………Page 289……………………………………………………………

                                                        CH A PT E R   1 0   ■    L E A R N I N G   A B O U T  P E R S IS T E N CE 267 



Empty Lines of Text 



The following code removes the empty lines of text。  



        If splitUpText(0)。Length = 0 Then 

            Continue Do 

        End If 



     When  lotto。txt was processed; the output data stream generated a single field array for  

an empty line。 So; we know that if the first field element has a length of zero; the line of text  

should be ignored。 



Empty Fields and Too Many Fields 



The next problem in our list is that some lines have empty text fields at the end。 Solving this  

problem would probably entail a solution similar to the previous one; but you should think of  

the big picture and understand that solving one problem might also solve another problem。 In  

this case; solving the problem of the empty fields also helps solve the problem of having too  

many fields。 

     Both of these problems are solved by knowing the data that is being manipulated。 As you’ve  

seen; the data stream assumes the following format: date; then lottery numbers 1 to 6; and then  

the bonus number。 The parts of the data stream that are not correct have the same format; with  

some extra information like replay number and empty fields。 Thus; the fix is to copy the date  

and append the remaining fields; as follows: 



            retval。Append(newDate) 

            For c1 As Integer = 0 To 7  

                retval。Append(〃 〃 & splitUpText(c1)) 

            Next 



     The first line of code appends the date to the StringBuilder buffer (retval)。 Then in the  

For loop that follows; a space and the fields 0 to 7 are copied to the StringBuilder buffer。 



Incorrect Data Format 



In some of the fields; the date has a period separator; in others; it has a hyphen。 The correct  

format is a period; and the code that fixes the date format is as follows: 



        If splitUpText(0)。Contains(〃…〃) Then 

            Dim dateSplit As String() = splitUpText(0)。Split(New Char() {〃…〃c}) 

            Dim newDate As String = 

                dateSplit(0) & 〃。〃 & dateSplit(1) & 〃。〃 & dateSplit(2) 



     A fix is needed if the first field contains a hyphen。 The  If statement tests for this using the  

Contains() method。 If a fix is needed; the first field is separated again into three subfields; where  

each subfield represents a part of the date (year; month; day)。 Then those three sub
返回目录 上一页 下一页 回到顶部 0 0
快捷操作: 按键盘上方向键 ← 或 → 可快速上下翻页 按键盘上的 Enter 键可回到本书目录页 按键盘上方向键 ↑ 可回到本页顶部!
温馨提示: 温看小说的同时发表评论,说出自己的看法和其它小伙伴们分享也不错哦!发表书评还可以获得积分和经验奖励,认真写原创书评 被采纳为精评可以获得大量金币、积分和经验奖励哦!