Alex H.
2018-11-13 00:36:19 UTC
As I mentioned in my previous post, I am trying to build a Doc2Vec model
with a set of emails. Each document has at least one tag. The tag that all
emails have are their respective, unique email IDs. In addition to that, a
large chunk of the emails have a second tag, which is the email sender's
email address. The idea of including the email sender tag is that perhaps
the algorithm can capture patterns that are characteristic of specific
email senders. However, a subset of the emails don't have the email sender
tag because of missing metadata. I wonder what the algorithm would do with
documents that don't have the second tag. Thanks!
with a set of emails. Each document has at least one tag. The tag that all
emails have are their respective, unique email IDs. In addition to that, a
large chunk of the emails have a second tag, which is the email sender's
email address. The idea of including the email sender tag is that perhaps
the algorithm can capture patterns that are characteristic of specific
email senders. However, a subset of the emails don't have the email sender
tag because of missing metadata. I wonder what the algorithm would do with
documents that don't have the second tag. Thanks!
--
You received this message because you are subscribed to the Google Groups "Gensim" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gensim+***@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
You received this message because you are subscribed to the Google Groups "Gensim" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gensim+***@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.