q***@gmail.com
2018-11-28 09:50:56 UTC
Is there any way to filter the vocabulary based on the document frequency
of the words?
Like we have the ability in the `*Dictionary*` object where we call `
*filter_extremes*` to do it.
Doc2Vec does have the `*min_count*` parameter, which i think represents the
term frequency. additionally `*trim_rule*` is there, which i think can be a
way but may have some performance issues .
Please suggest if `*Dictionary*` object can be passed to *Doc2Vec *for
building vocabulary or are there any other methods
of the words?
Like we have the ability in the `*Dictionary*` object where we call `
*filter_extremes*` to do it.
Doc2Vec does have the `*min_count*` parameter, which i think represents the
term frequency. additionally `*trim_rule*` is there, which i think can be a
way but may have some performance issues .
Please suggest if `*Dictionary*` object can be passed to *Doc2Vec *for
building vocabulary or are there any other methods
--
You received this message because you are subscribed to the Google Groups "Gensim" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gensim+***@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
You received this message because you are subscribed to the Google Groups "Gensim" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gensim+***@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.