returns a vector with words of a language. The intent behind it is to test regex patterns

all_words(lang)