Showing posts with label RDFa. Show all posts
Showing posts with label RDFa. Show all posts

Error : The entity “nbsp” was referenced, but not declared

Error : The entity “nbsp” was referenced, but not declared while parsing RDFa using Semalgl parser.


Solution : Using &#160 instead of &nbsp

Linked Data for Information Extraction Challenge 2014

The extraction of data from unstructured or semi-structured web sources has been recognized as a suitable way of populating the data web. Since many pages already use embedded structured data, e.g., as RDFa, Microformats, or Microdata, this information can be used to bootstrap and trainsupervised systems extracting structured data from the web. In this challenge, we want to compare systems using such annotated pages for extracting information from the web.

The challenge will be held at the Linked Data for Information Extraction (LD4IE) workshop 2014, co-located with the International Semantic Web Conference. The authors of the best performing system are awarded a 250 € book voucher, kindly sponsored by Springer.

For detail information, refer to http://data.dws.informatik.uni-mannheim.de/LD4IE/

Useful tools of annotation for your websites

1. Linked Open Vocabularies - http://lov.okfn.org/dataset/lov/
    - Entry point to the growing ecosystem of linked open vocabularies (RDFS or OWL ontologies) used in the Linked Data Cloud.

2. Namespace Lookup for annotation (for RDF, RDFa) - http://prefix.cc/
    - Look up namespace for vocabularies (ontologies)

FOAF and SIOC applications



It was great slideshare on FOAF & SIOC applications, unfortunately, many of the links for applications are down...  It would be great if applications developed within research period could last for long time and create more meaningful effect for the industries and apparently, this is not a trivial work. But also, many commercialized  products conducted from research as well in DERI, for example, Sindice, Seevl... 

RDFa Parsing for Website (using Semargl API)

1. Create Maven Project

2.Add Semargl dependencies (Refer to Semargl website for details)


3.Test for your website. (Change "http://localhost:8888/WiseWord/index.php" to your URL)

1:  public class Tester {  
2:    public static void main (String[] args) throws ParseException{  
3:    CharOutputSink outputSink = new CharOutputSink();  
4:    StreamProcessor sp = new StreamProcessor(  
5:    RdfaParser.connect(TurtleSerializer.connect(outputSink)));  
6:    outputSink.connect(System.out);  
7:    sp.process("http://localhost:8888/WiseWord/index.php");  
8:    }  
9:  }  

RDFa Validation, Crawling for Websites

Tested RDFa Validation with RDFa Validator for http://wiseword.tk (which will be redirected to http://guangyuan.dothome.co.kr)


Validation result....


The source is valid, but when I try to use Apache Any23, it is failed...


Apache Any23 Extraction Result...


Because of the vocabularies are limited for below listings? Then, this tool could not be used for extracting SIOC data and so that LDspider (which is crawling framework based on Any23) as well?

RDFa - resource and about


Both of them are used to refer resources, and when the usage comes to refer subject or context in Subject-Predicate-Object, they are interchangeable.

Besides, the difference between them is that resource can be used with property to refer object or target which about can't.