View Full Version : A Hebrew Tree Bank Based on Cantillation Marks

01-31-2010, 12:45 AM
What a cool idea!:cool:

A Hebrew Tree Bank Based on Cantillation Marks

In the Masoretic text of the Hebrew Bible (HB), the cantillation marks function like a punctuation system that shows the division and
subdivision of each verse, forming a tree structure which is similar to the prosodic tree in modern linguistics. However, in the
Masoretic text, the structure is hidden in a complicated set of diacritic symbols and the rich information is accessible only to a few
trained scholars. In order to make the structural information available to the general public and to automatic processing by the
computer, we built a tree bank where the hierarchical structure of each HB verse is explicitly represented in XML format. We coded
the punctuation system in a context-tree grammar which was then used by a CYK parser to automatically generate trees for the whole
HB. The results show that (1) the CFG correctly encoded the annotation rules and (2) the annotation done by the Masoretes is highly
http://www.cs.brandeis.edu/~marc/misc/proceedings/lrec-2006/pdf/6_pdf.pdf (http://www.cs.brandeis.edu/%7Emarc/misc/proceedings/lrec-2006/pdf/6_pdf.pdf)

02-14-2010, 10:56 PM
Hi,I'm new here and really like this forum, it looks like it has a lot of great info.It'll be nice to get to know everyone on here.Thanks.Have a nice time ahead.

Yes, by all means. You, too, have a nice time ahead!