This Is AuburnElectronic Theses and Dissertations

Show simple item record

Data Extraction from Servers by the Internet Robot


Metadata FieldValueLanguage
dc.contributor.advisorWilamowski, Bodgan
dc.contributor.authorPham, Nam
dc.date.accessioned2009-07-15T14:21:24Z
dc.date.available2009-07-15T14:21:24Z
dc.date.issued2009-07-15T14:21:24Z
dc.identifier.urihttp://hdl.handle.net/10415/1781
dc.description.abstractData extraction from internet is a way to download and extract the required data automatically from web servers. In this thesis, we present a method called the Internet Robot to extract the data directly from a web server by using Perl scripting language with the powerful regular expressions. The regular expressions are widely used in this method to reduce the complexity of the program code as well as increase up the downloading and extracting speed. The Internet Robot in this thesis is a process of three steps: data collection, data filtering and processing and data presentation. The final result of this process will be the html files- with all required data in the typical format that is presented under different links of a webpage. The accuracy and speed make this method become unique in processing and extracting data not only from the internet but also from an available database.en
dc.rightsEMBARGO_NOT_AUBURNen
dc.subjectElectrical Engineeringen
dc.titleData Extraction from Servers by the Internet Roboten
dc.typethesisen
dc.embargo.lengthNO_RESTRICTIONen_US
dc.embargo.statusNOT_EMBARGOEDen_US

Files in this item

Show simple item record