Wednesday, February 14, 2007

XML vs HTML

HTML (Hyper Text Markup Language) is the language we use to build webpages. If you right click on a webpage and [view source], invariably notepad will open with a page full of gobbledigook but somewhere near the top you'll see the tag html. This is the language of the web and it tells the web-browser WHAT to display.

HTML can also go hand in hand with CSS (Cascading Style Sheets) which tell the web-browser IN WHAT WAY to display the html instructions. For instance, go to the webreality website, right click on any area of white space to [view source], and about 20 lines down find the sentence which reads [div id="navigationcontainer"]. Underneath that you'll notice lots of tags within which read [li]. These are list items and are referring to the navigation links on the website:



However, it is the CSS which tells the website to display these links in a straight horizontal line, with the white dashes in between and in a white font.

So what I'm saying is that there are two main elements to building a website:

  1. HTML - Tells the web-browser what to display
  2. CSS - Tells the web-browser where and how to display

However, HTML has a cousin called XML - like HTML it also identifies elements in the webpage and it uses a very similar syntax to HTML. However XML is much more portable and can be used in many different types of applications. It is for this reason that, with the advent of Web2.0, XML will probably end up replacing HTML altogether.

However at the moment not all browsers support the XML language so for now HTML is still the primary language of the web. If you'd like to understand this more have a look at the video below.


Wednesday, February 07, 2007

Google tip: Why it's useful to search your own website URL

Google has a few tricks up its sleeve to help you see how your website is doing. In the Google searchbar there are a number of operators you can use to bring up information on your website.

Firstly, if you type in site:www.yourdomain.com, Google will bring up every single page within your site that has been trawled and indexed. If it brings up no records, then your URL has not been indexed and needs to be submitted. To do this, go to: http://www.google.com/addurl/. You can also use the [site:] command to find particular pages of your website. For instance, type into the searchbar SEO site:www.webreality.co.uk and Google will find the page(s) within the webreality site that contain information on search engine optimisation (SEO).



One of our clients the other day had inadvertantly managed to delete a section of their website. Rather than rolling back the database we were able to retrieve the content of that page by using the [cache:] command. The query [cache:] will show the version of the web page that Google has in its cache, in other words the last snapshot that was taken during the trawl.

Another useful operator is the [link:] command which will bring up all of the webpages linking to your website. Inbound hyperlinks are effective in improving your search engine rankings, especially if the link comes from meaningful keywords.

But remember, when it comes to search engines, content is a key factor in maintaining and improving upon rankings. To keep the search engines trawling it is important to make sure you update your site regularly with meaningful and useful copy.