alley with sematag 0
There is a great need for the data model supported by the WWW to be extensible. The WWW data model could be roughly described as a text model formalized by HTML which embeds other data or live objects like images, video, Java or OLE objects.
The current trend observed on the WWW is to define a way for extending this model to support more data or live objects [Berners 95]while nothing is done to define how the text model could be extended.
There is no doubt that the large success of the WWW came from the sudden availability of a great amount of world-wide searchable textual information. This search ability was made possible because the information was expressed in a public non-opaque non-binary text format: HTML.
While it is without any doubt necessary to enable the WWW to handle more and more data or live objects, these are often expressed in a non-public or binary format. The proliferation of this kind of data, minimizing, slowly but surely, textual data expressed in HTML could, in the long term, jeapordize the searchable exploitable working scheme based on textual formats which has proved the success of the WWW.
1.1 Tag Proliferation
HTML suffers from its lack of extensibility, and anarchical tag proliferation is in danger of breaking the WWW. A lot of the recent tag proliferation came from the need to offer improved presentation and layout of online documents. It is obvious that HTML, in that area, is far from the sophisticated layout formats which exists in pre-press software such as Adobe PageMaker or Quark XPress.
The purpose of associating improved presentation, or at least adding some flexibility to the current HTML scheme is, in our view, addressed by the associating of style sheets to the HTML tags, and the work of the W3C in that direction (CSS [Lie 95], DSSSL[Bozak 95]) is a solution which should be adopted.
1.2 Semantic tags
Another reason for tag proliferation has been the need to express and use more complex structured information than is found in the HTML structure model.
Documentary data are more clearly identified by semantic tags [Paoli 94]. Semantic tags are used to precisely identify corporate or industry specific information such as motors, product parts, transistors, or other objects which require a very precise description. This is one of the strong points of SGML and there is always a need for mission-critical data to be formalized and stored using such markup.
It is in specialized corporate contexts that we see the greatest need for such markup. It is generally accepted that people in this kind of corporate environment often manipulate the same kind of structured information which describes very precisely the nature of their business or technical data. In the case of an electronic components manufacturer, for example, it would not be uncommon for engineers to work together on the definition of a new electronic device and on the documentary data which could best describe it.












