Publication:
Comparative study between regular expression and google similarity index for instance based schema matching

dc.contributor.affiliation#PLACEHOLDER_PARENT_METADATA_VALUE#en_US
dc.contributor.authorAlzeber, Mogaheden_US
dc.date.accessioned2024-10-08T07:42:17Z
dc.date.available2024-10-08T07:42:17Z
dc.date.issued2016
dc.description.abstractSchema matching is considered as one of the essential phases of database integration. The aim of the schema matching process is to identify the correlation between Schemas which help later in the data integration process. The main issue concern during schema matching is how to support the merging decision by providing the correspondence between attributes through syntactic and semantic heterogeneous in data sources. There have been a lot of attempts in the literature toward utilizing database instances to detect the correspondence between attributes during schema matching process. Many schema matching approaches based on instances have been proposed aiming at improving the accuracy of the matching process. We observed that no single technique managed to provide accurate matching for different types of data. In other words, some of the techniques treat numeric values as strings. This will negatively influence the process of discovering the match and further on the quality of match results. Similarly, other techniques treat textual instance, as numeric, and this will also impact the quality of the match result. Thus, a comparative study between syntactic and semantic techniques is needed. The study should emphasize on analyzing these techniques deeply in order to determine the strengths and weaknesses of each technique. This thesis aims at developing two schema matching techniques, namely: (i) regular expression and (ii) Google similarity to identify the match between attributes for numeric, alphabetic and mix instances. Furthermore, comparing these techniques and evaluate their performance empirically. Several analyses have been conducted on real and synthetic datasets to evaluate the performance of the schema matching techniques considered in this thesis with respect to Precision (P), Recall (R) and F-Measure.en_US
dc.description.callnumbert TK 5105.88815 A478C 2016en_US
dc.description.degreelevelMaster
dc.description.identifierThesis : Comparative study between regular expression and google similarity index for instance based schema matching /by Mogahed Alzeberen_US
dc.description.identityt11100353613MogahedAlzeberen_US
dc.description.kulliyahKulliyyah of Information and Communication Technologyen_US
dc.description.notesThesis (MIT)--International Islamic University Malaysia, 2016.en_US
dc.description.physicaldescriptionxi, 121 leaves :ill. ;30cm.en_US
dc.description.programmeMaster of Information Technologyen_US
dc.identifier.urihttps://studentrepo.iium.edu.my/handle/123456789/9593
dc.identifier.urlhttps://lib.iium.edu.my/mom/services/mom/document/getFile/9rAhhnLusfNyowN8VPEfprB5kZi1BxI820170405111509423
dc.language.isoenen_US
dc.publisherGombak, Selangor : International Islamic University Malaysia, 2016en_US
dc.rightsCopyright International Islamic University Malaysia
dc.subject.lcshOntologies (Information retrieval)en_US
dc.subject.lcshArtificial intelligence--Data processingen_US
dc.subject.lcshSemantic integration (Computer systems)en_US
dc.titleComparative study between regular expression and google similarity index for instance based schema matchingen_US
dc.typeMaster Thesisen_US
dspace.entity.typePublication

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
t11100353613MogahedAlzeber_SEC_24.pdf
Size:
809.48 KB
Format:
Adobe Portable Document Format
Description:
24 pages file
Loading...
Thumbnail Image
Name:
t11100353613MogahedAlzeber_SEC.pdf
Size:
2.03 MB
Format:
Adobe Portable Document Format
Description:
full text secured file

Collections