You could use table, i.e.
n_occur <- data.frame(table(vocabulary$id))
gives you a data frame with a list of ids and the number of times they occurred.
n_occur[n_occur$Freq > 1,]
tells you which ids occurred more than once.
vocabulary[vocabulary$id %in% n_occur$Var1[n_occur$Freq > 1],]
returns the records with more than one occurrence.