Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
The Architecture of Internet and WWW Benefits of the Web as a delivery mechanism: •Thin clients •Platform independence •Easy for deployment/upgrade •Based on standards Web Browser HTML documents HTTP TCP/IP Client End User © Minder Chen, 1996-2013 Internet or Intranet Web Server Web Architecture - 1 Internet: The Virtual Network & Internal Structure NAP ISP Computer Router A Regional Network Internet TELCO LAN NAP: National Access Point ISP: Internet Service Provider © Minder Chen, 1996-2013 Web Architecture - 2 Hosts on the Internet • IP Datagram – Datagram Header Total about 4.3 billion IP addresses. • Source: 15.127.9.12 (IP Address) • Destination: 254.231.15.129 • American Registry for Internet Numbers https://www.arin.net/ Router © Minder Chen, 1996-2013 Web Architecture - 3 Domain Name • DNS (Domain Name Server): Mapping the four octets (32-bit numeric) IP address (such as 217.156.3.14) to a domain name such as student.csuci.edu • http://www.godaddy.com for domain name registration The name of a host computer with an IP address CSUCI.EDU edu First Level csuci student uiuc cs vax Top country domain name .us .tv (30M) .md Second Level © Minder Chen, 1996-2013 Web Architecture - 4 Key Elements of Web • Hypertext: Non-linear links to anchors of the same document, or to different documents on the same or different web site. – Using HTML (HyperText Markup Language) links and anchors – Relying on URL (Uniform Resource Locators) addressing scheme • Multimedia: graphics, video, sound, etc. – Web browser can access networked hypermedia. – Hypermedia access of the web browser is facilitated by helper applications, plug-ins, or document viewers. • Network: Global reach and "Universal" access – Based on HTTP which is built on top of the TCP/IP protocol. © Minder Chen, 1996-2013 Web Architecture - 5 Client-Server Model of http 1 helper app helper app helper app helper app helper app Web Site • Requesting a document via a Web Server URL address • Connection open Internet or Intranet Web Client Static 2 Dynamic • Returning MIMEcompliance document • Connection close Web contents • HTML documents • Images • Animation • Video clips • Sound bites • Java applets • Java Scripts • CGI scripts • JSP • Database access • ASP/ASPX/PHP • JSP MIME: Multipurpose Internet Mail Extension © Minder Chen, 1996-2013 Web Architecture - 6 HTTP • HTTP: HyperText Transfer Protocol • Characteristics: – Runs on top of TCP/IP (Transmission Control Protocol/Internet Protocol) – Has a comprehensive addressing scheme, i.e. URL – An extensible and open representation for data type, using MIME header – Has a stateless protocol. There is no memory between client connections. Cookies are used to track users. – Is efficient – Is portable – Possible extensions, Secure HTTP (SHPPT), SSL. (https://URL) © Minder Chen, 1996-2013 Web Architecture - 7 The Extended Framework of WWW HTML Authoring Tools/Editors Web Browser Web Designer & Publisher External Applications Non-HTTP objects Web Programmer • ColdFusion, • CGI (Perl) • ASP & ASP.NET • JAVA Servlet • Java Server Pages • Java Applet • JavaScript Internet Global Reach Broad Range Client End User © Minder Chen, 1996-2013 Web Server Web Master Web Architecture - 8 <html> <head> <title> HTML Sampler Web Page </title> </head> <body> <h1>HTML Sampler</h1> <img src="eyes.gif"><br> <b> List: </b> <ul> <li> Item 1 <li> <font size=+2>Item 2</font> <li> <i>Item 3</i> </ul> <table border=1> <tr> <th> Name</th> <th>Speciality </th> </tr> <tr> <td> Minder Chen</td> <td>MIS </td> </tr> <tr> <td> Justin Chen</td> <td>Aerospace </td> </tr> </table> <p> A sample of a form: <form method="post" action="http://hoohoo.ncsa.uiuc.edu/cgi-bin/post-query" > Name: <input type="text" size=20> <br> <input type="checkbox" name="html" value="YES"> Know HTML <br> <b><input type="submit" value="Submit your data"></b> </form> <hr> Developed by <a href="http://www.erols.com/aitc/"> Advanced IT Consulting</a> </body></html> HTTP (Web) Cookie This is how most web sites used to track you and maintain online sessions. • First party and third party Cookies at http://on.wsj.com/LamyJX © Minder Chen, 1996-2013 Web Architecture - 10 Clickstream • Hit – any Web server request that generates a log file entry. A page has many elements (html, gifs), each generating a hit. • Page – Web server file that is sent to client user agent, usually a browser. • Session – all actions (i.e. requests, resets) made in single visit, from entry until logout or time out (e.g., 20 minutes of no activity). • Visitor – a user or bot/spider/crawler that makes requests at a site. Can be new, returning, registered, anonymous • Buyer – visitor that purchases something • Customer – a visitor that registers (sometimes defined as buyer) • Conversion – rate at which visitors transition to desired state • (buyers, customers, registered, started checkout) • Host – remote machine, identified by IP address, used for visit. • Referrers – page that provides a link to another page. Can be internal or external © Minder Chen, 1996-2013 Web Architecture - 11 Web Log Analysis • • • • • • • • • • • • • • • • Number of visits and number of unique visitors Visits duration and last visits Authenticated users, and last authenticated visits Days of week and rush hours Domains/countries of host's visitors. Hosts list Number total pageviews Most viewed, entry and exit pages Files type OS used Browsers used Robots HTTP referrer Search engines, key phrases and keywords used to find the analyzed web site HTTP errors Some of the log analyzers also report on who's on the site, conversion tracking, visit time and page navigation. © Minder Chen, 1996-2013 Web Architecture - 12