Ask Dave Taylor
  • Facebook
  • Instagram
  • Linkedin
  • Pinterest
  • Twitter
  • YouTube
  • Home
  • YouTube Videos
  • Top Categories
  • Subscribe via Email
  • Ask A Question
  • Meet Dave
  • Home
  • Computer & Internet Basics
  • Can the FBI Review 650,000 Emails in Just a Few Weeks?

Can the FBI Review 650,000 Emails in Just a Few Weeks?

November 7, 2016 / Dave Taylor / Computer & Internet Basics / 12 Comments

I’m skeptical! The FBI found 650,000 emails on a server and in two weeks managed to go through them all and decide there was nothing damning? How could someone possibly review all those messages in so little time?

In early October 2016, FBI agents discovered 650,000 emails on a computer owned by Anthony Weiner, the husband of top Clinton aide Huma Abedin.  A week or so later FBI Director James Comey announced they were investigating the messages to see if there were breaches of confidentiality related to Hillary Clinton’s private email server from when she was US Secretary of State. November 6th the FBI announced the investigation was complete and exonerated Clinton of  any wrongdoing.

In response, Donald Trump insisted at a rally on Nov 6 that it would have been impossible for the FBI to review what has been reported to be as many as 650,000 emails in such a short time.

That’s what you’re asking about, and it’s a legitimate question. But the FBI completing this task in the allotted time is not only entirely believable, it’s likely that the primary research and investigation was done quite a bit faster. How? Because researching 650,000 email messages doesn’t mean each and every one has to be read word-for-word. That’s why we have search engines, and they’re incredibly fast at finding needles in haystacks.

To demonstrate, I’ll turn to my own Gmail archive, which contains over 100,000 email messages. To get to it, I simply clicked on “All Mail”:

gmail, all mail, menu

Gmail shows me that my email archive actually contains 112,092 messages:

112,092 email messages

That’s not 650,000 email messages, but it’s about 1/6th of the Weiner email archive. Now, how fast can I search for a specific name? Let’s try Obama:

gmail search for obama

Before I’ve even finished typing in the word Google’s Gmail service is searching the archive and showing me potential matches, including two messages from February, 2013.

Once I press Enter, Google completes the search:

gmail search, 112,000 messages, for obama

There are matches from mailing lists I’m on that can and should be removed from the results. No problem, I can get rid of those by using what Gmail calls stop words, words that are prefaced by a dash.

gmail search with stop words, 112,000 messages obama clinton

See the numeric result above? In just a few seconds of work, I weeded through 112,000 email messages, those sent and those received, to isolate the 128 that reference Obama.

But the Weiner email archive likely encompasses much more than just the date range under investigation, so that’s a fast way to limit the search results too. Gmail has that covered too: Let’s take the exact same search as above, but isolating matches to only those sent or received between Jan 1, 2015 and July 4, 2015.

That’s done by adding after:2015/1/1 before:2015/7/4 to the search string. That chops things down so efficiently that I don’t even need the stop words used above:

gmail email search with date range obama

Now, if I can do this and find the 44 matching messages in an archive of 112,000 email messages in less than 30 seconds, do you think it’s possible that the FBI has better tools, faster computers and highly trained agents with a dataset of hundreds of key words, names, agencies, and even nicknames and code names, and apply that to the 650,000 email messages from Weiner’s server?

If the intention was for every single email message to be read and analyzed, that could be done in a few weeks by the FBI too, actually: The Federal Bureau of Investigation has “nearly 35,000 employees, including special agents and support professionals such as intelligence analysts, language specialists, scientists, and information technology specialists.”

Let’s say that only 10% of them are trained to do this sort of research, that still divides 650,000 messages across 3,500 employees, which works out a modest 185 messages per agent. To review in three weeks. If they just do it on lunch break for those three weeks that’s a mere 12 messages per day they need to read and review before the entire dataset is reviewed. That’s few enough that it wouldn’t even adversely affect their lunch plans.

To be fair, if you’re thinking that the research team is just a couple of agents who are manually printing out, reading, analyzing and cross-checking 650,000 email messages, then you’re right to be skeptical. That’s a daunting task. But that’s not how it works and to this tech expert, at least, three weeks to review and analyze the entire dataset is entirely believable.

Addendum: Turns out that the whole problem was quickly reduced down to a much smaller dataset anyway. As the New York Times reported: “law enforcement officials said, there was no need to review all of the emails, only Ms. Abedin’s. Those emails numbered in the thousands, and even many of those were duplicates of messages that had been looked at previously”.

Let’s Stay In Touch!

Never miss a single article, review or tutorial here on AskDaveTaylor, sign up for my fun weekly newsletter!
Name: 
Your email address:*
Please enter all required fields
Correct invalid entries
No spam, ever. Promise. Powered by FeedBlitz
Please choose a color:
Starbucks coffee cup I do have a lot to say, and questions of my own for that matter, but first I'd like to say thank you, Dave, for all your helpful information by buying you a cup of coffee!
analyze email, anthony weiner, email server, fbi, fbi investigation, gmail, gmail search, hillary clinton

12 comments on “Can the FBI Review 650,000 Emails in Just a Few Weeks?”

  1. William says:
    November 23, 2016 at 4:41 pm

    Regardless of all that timing, the LAW does not have any allowance for INTENT. The mere act of DOING it was a felony without any need of intent. And even if they felt the need for intent, there were several instances of spoken and written intent to avoid transparency from FOIA requests.

    Reply
  2. Peter says:
    November 13, 2016 at 8:12 pm

    Well the only thread they would be looking for would be the ones that came out of her private email server and not the government server. So that alone would reduce the number of emails to search. Anything out of the government server would be ignored as it went through proper channels irrespective of the information they contained.

    Reply
  3. Joseph says:
    November 13, 2016 at 3:09 pm

    I believe a lot more people are involved and that is why FBI killed the investigation.

    Reply
  4. R C says:
    November 13, 2016 at 2:40 pm

    Scan 30k pages through OCR to text files maybe 1 day, 1-1/2 years to evaluate as you described above?! Still sounds like they stalled for 1-1/2 years!

    Reply
  5. R Chance says:
    November 13, 2016 at 2:36 pm

    How long would it take them rto copy the pages, OCR them to files and do the same thing you said above? One day at most to convert to files, 1-1/2 years to analyze?!!

    Reply
  6. JeffreyW says:
    November 12, 2016 at 12:03 pm

    And if an email is written in code, for example the search is for evidence of drug usage. “I am going to wear the red dress tonight.” That means the cocaine will be delivered tonight at a certain building associated with “red.” The key word search is only good for obvious phrasing.

    Reply
    • Dave Taylor says:
      November 13, 2016 at 7:55 am

      Do you really think the professionals at the FBI don’t know that, Jeffrey? And don’t have a way of figuring out code phrases and such?

      Reply
  7. Frank says:
    November 12, 2016 at 9:36 am

    Sooo what you have shown is that the FBI was stalling for 1-1/2 years with the original 30,000+ emails!

    Reply
    • Dave Taylor says:
      November 12, 2016 at 9:47 am

      Not necessarily, Frank. I have read that the original 30,000 emails were in printed form. If that’s true, then it would obviously be quite a bit more difficult to analyze!

      Reply
      • tim says:
        November 13, 2016 at 2:39 pm

        Excellent. With the 35,000 employees sited that’s less than a page a piece for 1 1/2 years. Just sayin’

        Reply
      • Jeffersonian says:
        November 13, 2016 at 11:07 pm

        Before they were in printed form, they were in electronic form.
        Point is that they were still stalling, no excuse.

        Reply
      • Jeffersonian says:
        November 13, 2016 at 11:20 pm

        Either they are brilliant and effective, or they are ineffectual and dolts.
        Come on guy, make up your mind.
        See it for what it is, they sat on it for 1 1/2 years.
        The only logical reason for the delay is that they were being stonewalled,
        and the only way to get around it was to go public like they did.

        Reply

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

Recent Posts

  • How to Change Closed Captions Font, Size, and Color in YouTube?
  • Can I Identify Plants and Flowers with the iPhone Camera?
  • HeyFriday.AI Can Help You With Writing Projects from SEO to Blogging
  • How Can I Tell What Version of Windows 11 is Installed and if it’s Activated?
  • Can CSS Make the Quoted Passages on my Web Pages Look Great?

On Our YouTube Channel

EZQuest Compact 3-Port 90W USB-C Wall Charger -- REVIEW

PADMATE PaMu Slide 2 ANC Earbuds -- UNBOXING & PREVIEW

Categories

  • AdSense, AdWords, and PPC Help (106)
  • Amazon, eBay, and Online Shopping Help, (161)
  • Android Help (196)
  • Apple iPad Help (144)
  • Apple Watch Help (52)
  • Articles, Tutorials, and Reviews (344)
  • Business Advice (199)
  • Chrome OS Help (23)
  • Computer & Internet Basics (759)
  • d) None of the Above (164)
  • Facebook Help (382)
  • Google, Chrome & Gmail Help (177)
  • HTML & Web Page Design (245)
  • Instagram Help (47)
  • iPhone & iOS Help (604)
  • iPod & MP3 Player Help (173)
  • Kindle & Nook Help (92)
  • LinkedIn Help (85)
  • Linux Help (163)
  • Linux Shell Script Programming (86)
  • Mac & MacOS Help (888)
  • Most Popular (16)
  • Outlook & Office 365 Help (24)
  • PayPal Help (69)
  • Pinterest Help (53)
  • Reddit Help (17)
  • SEO & Marketing (81)
  • Spam, Scams & Security (92)
  • Trade Show News & Updates (23)
  • Twitter Help (217)
  • Video Game Tips (66)
  • Web Site Traffic Tips (62)
  • Windows PC Help (913)
  • Wordpress Help (204)
  • Writing and Publishing (72)
  • YouTube Help (46)
  • YouTube Video Reviews (159)
  • Zoom, Skype & Video Chat Help (56)

Archives

Social Connections:

Ask Dave Taylor


Follow Me on Pinterest
Follow me on Twitter
Follow me on LinkedIn
Follow me on Instagram


AskDaveTaylor on Facebook



microsoft insider mvp


This web site is for the purpose of disseminating information for educational purposes, free of charge, for the benefit of all visitors. We take great care to provide quality information. However, we do not guarantee, and accept no legal liability whatsoever arising from or connected to, the accuracy, reliability, currency or completeness of any material contained on this site or on any linked site. Further, please note that by submitting a question or comment you're agreeing to our terms of service, which are: you relinquish any subsequent rights of ownership to your material by submitting it on this site. Our lawyer says "Thanks for your cooperation."
© 2022 by Dave Taylor. "Ask Dave Taylor®" is a registered trademark of Intuitive Systems, LLC.
Privacy Policy - Terms and Conditions - Accessibility Policy