In some languages there is a lowercase connector between two or more proper names. This function returns a regex pattern by language with lowercase allowed connectors.
connectors(lang = "pt")
connectors("es")
#> [1] "del"
connectors("pt")
#> [1] "da" "das" "de" "do" "dos"
connectors("port")
#> [1] "da" "das" "de" "do" "dos"
connectors("en")
#> [1] "of" "of the"
connectors("misc")
#> [1] "of" "the" "of the" "von" "van" "del"