part 6 paragraph


SUBMITTED BY: Guest

DATE: June 1, 2013, 2:50 a.m.

FORMAT: Text only

SIZE: 1.6 kB

HITS: 9923

  1. --6--
  2. adjective in I00 occurrences in a text, the probability of
  3. havipg an adjective as a dependent is 0.75. The zeroes and
  4. ones in Table I are constant for all words in the glossary.
  5. These values are not listed in the sets of probability
  6. values for the entrles of the glossary; however, they are
  7. known to the system. For instance, the set of probability
  8. values for a transitive verb will contain PI' P2' and P3"
  9. The probability I of governing a noun as object will not
  10. be listed in the data.
  11. The second type of co--occurrence data accompanying
  12. every word in the glossary is a list of possible dependents.
  13. The list is specified in terms of word numbers and semantic
  14. classes (to be described later). It contains the words that
  15. actually appear in the processed physics text as dependents
  16. of the word with which the list is associated. Since the
  17. lists of dependents are compiled on the basis of word co-
  18. occurrence in the text, legitimate word combinations are
  19. guaranteed. In the list of dependents for a verb~ those
  20. words which can only be the subject are marked "S" and
  21. those which can only be the direct object are marked "0".
  22. The co--occurrence data can be regarded as either
  23. syntactic or semantic. They are distinguished here from
  24. both the dependency rules and part of speech designation,
  25. and from the semantic classes that have been established.
  26. At present, seventy--four semantic classes have been set up.
  27. Some of these are formed distributionally (i.e., on the

comments powered by Disqus