Quote Originally Posted by acheung View Post
Thank you very much for your kind help. For now, I found an approximate method to exclude proper nouns by searching for them in morphological assistant:
1. type wtm [enter] in command line
2. l isa
3. go to morphological assistant, select POS: noun, type: Proper name, lemma: *, which results in code @np--*, then Lookup and yield 191 forms, 1239 hits
4. I can subtract the number of forms to get the number of unique words minus proper nouns.
5. Since there are some high frequency proper nouns, if I set the frequency low, it will result in considerable double counting, so I will have to look at the exported word list and make a rough estimate for adjustment. The result is not exact, but reasonably good for my purpose.

Alex Cheung
Hello Alex,

I'm doing this with New Testament texts, so the the same thing in a Hebrew text may yield results that are a little different. Here's what I've done to build a list from Romans, excluding proper names from a search:

1. In the context tab, right click on "Export list to Word list Manager"; this will send the list to the "Main word list" (i.e., left-hand column).
2. With the Word List Manager (WLM) still open, select BGM as your search text and limit your search to Romans. Then type in the command line the following: .*@n???p (The "p" at the end is for "proper name"). Type enter. This will give you a list of all proper names in Romans.
3. In the WLM, select "Secondary word list", then "load or create word list"
4. In the new window that opens, select BGM as your search version, then "load highlighted words from last query" (as said in a previous post, I take the precaution of selecting "use search window limits"). Then deselect "keep Greek accents and Hebrew vowel points". This seems to be necessary, as the list created from the context tab doesn't have accents (I realized that the hard way! More on that below). Then create list.
5. In the WLM, you now have all the words in Romans in your Main word list, and all the proper names in Romans in your Secondary word list.
6. Select "Main Word List" (radial button), then click on "select" => "select words common to both lists"
7. Then "Edit" => "delete selected." That will give you your list of all the words in Romans in the "Main Word list", minus all the proper names.
8. You can then select the words that occur more than 50 x, and delete them from the list.

A couple things: in between steps 2 and 3, BW will automatically send the results of your query on proper names to the WLM, in the Secondary word list. This will not help, though, because it sends the words with their accents, and the WLM doesn't seem to be able to compare the two lists (i.e., Παυλος and Παῦλος, for instance, are seen as two different words). So you'll have to discount that automatically generated list and proceed to step 3.

One other thing: Did you know that you can also create a lexicon for that list? This can be helpful for learning the rarer words. Go to "File" => "make lexicon from selected words", then follow the steps there.

I'd be interested in hearing if this works as well in Hebrew texts (WTM). Please let me know!

Donald Cobb
Aix-en-Provence, France