Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
The Latest Web Developments Brian Kelly UK Web Focus UKOLN University of Bath Bath, BA2 7AY UKOLN is supported by: Email [email protected] URL http://www.ukoln.ac.uk/ About Me Brian Kelly: • UK Web Focus – a JISC-funded post to advise HE and FE communities on Web developments • Based in UKOLN (UK Office for Library and Information networking) – a small applied research organisation in University of Bath • Involved in Web since 1993, while working in Computing Service at University of Leeds • Close links with Computing Service and Library communities 2 About You What is your involvement with the Web? What topics would you like covered today? 3 Possible Interests XML Web applications File formats Content Management Systems Hyperlinking Legal issues Interests RDF When is it going to stabilise? What’s happening to HTML? Web Standards Web browsers Netscape or Microsoft? Web Architectures Web Applications 4 Technologies Web Services Open source vs licensed apps Contents • • • • • • • • 5 Standards and the Web The Original Web Architecture The Problems Architectural Developments Metadata New Developments Deployment Issues Discussion Standards, Architectures, Applications, Resources This talk touches on several areas Standards: concerned with protocols and file formats Open standards vs. Proprietary HTML / XML vs. PDF CSS / XSL vs. HTML Applications: software products used to implement systems 6 Apache / IIS FrontPage / Dreamweaver Oracle / SQLServer ColdFusion vs ASP Architectures: models for implementing systems Which standards are applicable NT / Unix File system / database application HTML tools / content management Resources: financial and staff costs needed to implement systems Development vs. Migration costs Use of in-house expertise In-house vs. out-sourced Licensed vs. open source Standards Need for standards to provide: • Platform independence • Application independence • Avoidance of patented technologies • Flexibility ("evolvability" - Tim Berners-Lee) • Architectural integrity • Long-term access to data Ideally look at standards first, then find applications which support the standards Difficult to achieve this ideal! 7 Deployment Issues What part of the spectrum are you closest to? Must support standards 8 Go with the marketplace I Support Standards But: • You probably use PowerPoint, don't you? • Software vendors will subtly suck you into use of proprietary features • Home-grown solutions can be expensive (where are all the good Perl / C programmers willing to work on short-term contracts for a pittance in Universities?) • Standards may not take off – remember Coloured Book network protocols? • Proprietary solutions may become standardised • Standards may not yet be available (or finalised) • Do users want standards? Will "We support standards" conflict with "Our services are based on user requirements"? 9 I Follow The Marketplace Good New Labour philosophy, but: • Can you trust your software vendor? • Will your software vendor be around in a few years time ("I only buy Rover") • Will your system be interoperable? • What happens when you want to interwork with partners or your organisation merges / is taken over? • What happens when you want to extend your system beyond the limits set by your software vendor? IBM was the market leader in the 1970s, but lost out in the PC revolution What will happen if Microsoft is split in two? 10 Some Difficulties We should acknowledge some difficulties in a standards-based approach: • Keeping up-to-date (look at nos. of documents at http://www.w3c.org/TR/ and size of http://www.diffuse.org/standards.html) • • • • 11 Spotting the winning standards Implementing the standard in a timely way Dealing with the problems of the software vendor Resources! Standardisation Other • Standards bodies such as ECMA • Community groups which can agree on, say, profiles HTML Proprietary extensions • De facto standards PDF and Java? • Often initially appealing W3C (cf PowerPoint) PNG • Produces W3C • May emerge as HTML ISO Recommendations standards • Produces ISO Z39.50 on Web protocols Java? Standards • Managed approach to • Can be slow moving developments and bureaucratic • Protocols initially • Produce robust IETF developed by standards W3C members • Produces Internet • Decisions made by Drafts on Internet protocols W3C, influenced by • Bottom-up approach to developments member and public • Protocols developed by HTTP review interested individuals PNG URN • "Rough consensus and working HTML whois++ code" HTTP 12 World Wide Web Consortium Much of the development of Web standards is being coordinated by the W3C: W3C (World Wide Web Consortium): • International consortium, with headquarters at MIT, INRIA and Keio University (Japan) • Coordinates development of web protocols • Four domains: • Architecture • User Interface 13 • Technology & Society • Web Accessibility The Web Vision Tim Berners-Lee's vision for the Web: • Automation of information management: If a decision can be made by machine, it should • All structured data formats should be based on XML • Migrate HTML to XML • All logical assertions to map onto RDF model • All metadata to use RDF A useful overview of Tim Berners-Lee's vision for the Web is given in his book Weaving The Web. 14 How Does The Web Work? The Web has 3 fundamental concepts: • URLs: addresses of resources • HTTP: dialogue between client and server • HTML: format of resources The Netsoft home page Welcome to Netsoft Web Browser (client) 15 1 User clicks on link to the address (URL) http://www.netsoft.com/hello.html 2 Browser converts link to HTTP command (METHOD): Connect to computer at www.netsoft.com GET /hello.html 3 Remote computer sends file <HTML> <TITLE>Welcome</TITLE>.. <P>Welcome to <B>Netsoft </B> 4 Local computer displays HTML file Web server Web Protocols Web initially based on three simple protocols: Data Format HTML Addressing Transport URL HTTP • Data Formats HTML (HyperText Markup Language) provides the data format for native documents • Addressing URLs (Uniform Resource Locator) provides an addressing mechanism for web resources • Transport HTTP (HyperText Transfer Protocol) defines transfer of resources between client and server 16 HTML 4.0, CSS 2.0 & DOM 1.0 HTML 4.0 used in conjunction with CSS 2.0 (Cascading Style Sheets) and DOM 1.0 provides an architecturally pure, yet functionally rich environment HTML 4.0 • Improved forms • Hooks for stylesheets • Hooks for scripting languages • Table enhancements • Better printing CSS Problems • Changes during CSS development • Netscape & IE incompatibilities • Continued use of browsers with known bugs 17 CSS 2.0 • Support for all HTML formatting • Positioning of HTML elements • Multiple media support DOM 1.0 • Document Object Model • Hooks for scripting languages • Permits changes to HTML & CSS properties and content CSS CSS: • Cascading Style Sheets • An open standard developed by W3C • Separates document structure (defined in HTML/XML) from the appearance • Makes maintenance of resources much easier <link rel="style" src="sty.css" <h1>Heading</h1> <p>…</p> Imagine 10,000 HTML files .. 18 http://www.w3c.org/Style/CSS/ body {background: blue;} h1: {font-family: arial} p: {font-family: times; text-align: justify} With 1 CSS file Limitations HTML 4.0 / CSS 2.0 have limitations: • Difficulties in introducing new elements – Time-consuming standardisation process (<ABBREV>) – Dictated by browser vendor (<BLINK>, <MARQUEE>) • Area may be inappropriate for standarisation: – Covers specialist area (maths, music, ...) – Application-specific (<STUD-NUM>) • HTML is a display (output) format • HTML's lack of arbitrary structure limits functionality: 19 – Find all memos copied to John Smith – How many unique tracks on Spice Girls CDs XML XML: • • • • Extensible Markup Language A lightweight SGML designed for network use Addresses HTML's lack of evolvability Arbitrary elements can be defined (<STUDENTNUMBER>, <PART-NO>, etc) • Agreement achieved quickly - XML 1.0 became W3C Recommendation in Feb 1998 • Support from industry (SGML vendors, Microsoft, etc.) • Support in Netscape 6 (?) and IE 5 20 XML Concepts Well-formed XML resources: Make end-tags explicit: <li>...</li> Make empty elements explicit: <img ... /> Quote attributes <img src="logo.gif" height="20" Use consistent upper/lower case Valid XML resources: Need DTD XML Namespaces: Mechanism for ensuring unique XML elements: <?xml:namespace ns="http://foo.org/ 1998-001" prefix="i"> <p>Insert <i:PART>M-471</i:PART></p> 21 More XML Developments Momentum behind XML is driving additional standardisation developments XML Path A language for addressing parts of an XML document, designed to be used by XSLT and XPointer XML Schemas (Ii) Defining the nature of XML schemas and their component parts XSLT A language for transforming XML documents into other XML documents … 22 XHTML XHTML: • Extensible Hypertext Markup Language • HTML represented in XML • Some small changes to HTML: – Elements in lowercase (<p> not <P>) – Attributes must be quoted (<img src="logo" height="50"> – Elements must be closed (< p >..</ p >) – Empty elements must be closed (<img src="logo" . />) 23 • Gain benefits from XML • Tools available (e.g. HTML-Kit from http://www.chami.com/html-kit/) • See <http://www.webreference.com/xml/ column6/>, <http://groups.yahoo.com/ group/XHTML-L/> and <http://www.ariadne.ac.uk/ issue27/web-focus/> Transport HTTP/0.9 and HTTP/1.0: Design flaws and implementation problems HTTP/1.1: Addresses some of these problems 60% server support Performance benefits! (60% packet traffic reduction) Is acting as fire-fighter Not sufficiently flexible or extensible HTTP/NG: 24 Radical redesign using object-oriented technologies Undergoing trials Gradual transition (using proxies) Moving slowly Addressing URLs (e.g. http://www.bristol-poly .ac.uk/depts/music/) have limitations: • Lack of long-term persistency – Organisation changes name – Department shut down or merged – Directory structure reorganised • Inability to support multiple versions of resources (mirroring) URNs (Uniform Resource Names): • Proposed as solution • Difficult to implement (no W3C activity in this area) 25 Addressing - Solutions PURLs (Persistent URLs): • Provide single level of redirection DOIs (Digital Object Identifiers): • Proposed by publishing industry as a solution • Aimed at supporting rights ownership • Business model needed OpenURLs • Address mirroring issues Pragmatic Solution: • URLs don't break - people break them • Design URLs to have long life-span Further information: 26 <URL: http://www.ukoln.ac.uk/metadata/resources/urn/> <URL: http://www.w3.org/Provider/Style/URI> Metadata Metadata - the missing architectural component from the initial implementation of the Web Addressing URL Metadata Needs: 27 • • • • • • • Resource discovery Content filtering Authentication Improved navigation Multiple format support New devices Rights management Transport Data format HTTP HTML Metadata Examples DSig (Digital Signatures initiative): • Key component for providing trust on the web • DSig 2.0 will be based on RDF and will support signed assertion: – This page is from the University of Bath – This page is a legally-binding list of courses provided by the University P3P (Platform for Privacy Preferences): • Developing methods for exchanging Privacy Practices of Web sites and user Note that discussions about additional rights management metadata are currently taking place 28 RDF RDF (Resource Description Framework): • Highlight of WWW 7 conference • Provides a metadata framework ("machine understandable metadata for the web") • Based on ideas from content rating (PICS), resource discovery (Dublin Core) and site mapping (MCF) • Based on a formal data model (direct label graphs) • Applications include: – cataloging resources – electronic commerce – intellectual property rights – resource discovery – intelligent agents – privacy • See <URL: http://www.w3.org/ Talks/1998/0417-WWW7-RDF> 29 RSS – An RDF Application RSS (Rich Site Summary): • Now an RDF application • Used for news feeds • Of interest to JISC (DNER architecture) • Lightweight approach that we should be investigating 30 See example of an RSS authoring tool and parser at <http://rssxpress.ukoln.ac.uk/>. Note this service uses CGI – a JavaScript solution is also being developed. RDF Conclusion RDF is a general-purpose framework RDF provides structured, machine-understandable metadata for the Web Metadata vocabularies can be developed without central coordination RDF Schemas describe the meaning of each property name Signed RDF is the basis for trust But: • Is RDF too complex? • Will it gain acceptance in the market place? The jury is till out 31 Other Web Developments Many Web standards developments are taking place outside W3C: • UDDI (Universal Description, Discovery, and Integration) – a way of describing Web services in a machine readable way to facilitate location of services by agents. See <http://www.uddi.org/> • Biztalk – a framework for developing XML schemas for B2B applications. See <http://www.biztalk.org/> • SOAP (Simple Object Access Protocol) - an XML protocol for exchange of information See <http://www.w3.org/TR/SOAP> 32 New Web Areas Initially the Web provided: • An open environment for • sharing information And aimed to: • provide a rich publishing and collaborative environment The Web is now: • Widely used in closed environments (Intranets and Extranets, for ecommerce, etc.) • Addressing the missing components from the original architecture • Addressing universally by providing the infrastructure for support of new devices 33 E-commerce E-commerce: • Requires trust • Requires security • Is there a viable business model? Developments: • Digital signatures • Public Key Infrastructure • Athens and Sparta in UK HE 34 Example 1 http://www.w3.org/Signature/ The Mobile Web The Mobile Web: • Much hype at present • Have you used it? • Is it usable on such a small screen with slow network times? • What about the resources need to build a WAP site and a Web site 35 Example 2 The Mobile Web Comments Store resources in neutral format (XML) and generate WAP and Web XML: open storage format XSLT: Transform XML Ebook format XML XSLT rules XSLT engine WML file for WAP XHTML for Web 3G promises multimedia and faster networks 36 Is It Worth It? Has the Web stabilised? • Are you thinking about WAP services? • Will you want to (be forced to) make your Web service accessible? • Will you want to deploy personalised interfaces (e.g. My.Oxford.ac.uk) • Will your web service move from information provision to e-business? • Do you want your University web site to use business-to-business (B2B) protocols to automate transfer of link and news items to HERO? 37 What Should I Do? How can I best exploit new developments? • Storing information in a structured format makes subsequent redevelopment easier • Be driven initially by standards and architectural considerations, not by applications • Consider use of more sophisticated web management tools, rather than HTML authoring tools • An organisational standards guidelines document (part of a Web Strategy document) may be useful • Don't work in isolation: – Monitor standards development (e.g. W3C) – Listen to others in your community – Talk and discuss issues within your community 38 Authoring Authoring Web pages: • Was easy • Becoming more difficult as Web becomes more complex • More difficult to maintain For large Web sites there is a need for: • More sophisticated tools e.g. content management systems • Tailoring content for devices? 39 Architectural Models There is a need for more intelligent software which can process structured resources or reformat unstructured ones HTML resource HTML / XML / database resource Web server browser Intelligent Web server Intermediaries can provide functionality not available at client: • DOI support • XML support 40 • Format conversion Web server simply sends file to client File contains redundant information (for old browsers) plus client interrogation support Client proxy browser Server proxy Architectural Models – e.g. XML Deployment Ariadne issue 14 has article on "What Is XML?" Describes how XML support can be provided: • Natively by new browsers • Back end conversion of XML - HTML • Client-side conversion of XML - HTML / CSS • Java rendering of XML Examples of intermediaries 41 See http://www.ariadne.ac.uk/issue15/what-is/ Conclusions To conclude: • The Web will continue to develop • Standards are important • Proprietary solutions are often tempting because: – – – – They are available They are often well-marketed and well-supported They may become standardised Solutions based on standards may not be properly supported by applications • Metadata is big growth area • Intermediaries may have a role to play in deploying standards-based solutions • There is a continual need to keep informed 42 Questions Any questions? 43