• No shipping costs from € 15, -
  • Lists and tips from our own specialists
  • Possibility of ordering without an account
  • No shipping costs from € 15, -
  • Lists and tips from our own specialists
  • Possibility of ordering without an account

Getting Structured Data from the Internet

Running Web Crawlers/Scrapers on a Big Data Production Scale

Jay M. Patel

Getting Structured Data from the Internet
Getting Structured Data from the Internet

Getting Structured Data from the Internet

Running Web Crawlers/Scrapers on a Big Data Production Scale

Jay M. Patel

Paperback | English
  • Available, delivery time is 10-15 working days
  • Not in stock in our shop
€65.95
  • From €15,- no shipping costs.
  • 30 days to change your mind and return physical products

Description

Jay M. Patel is a software developer with over 10 years of experience in data mining, web crawling/scraping, machine learning, and natural language processing (NLP) projects. He is a co-founder and principal data scientist of Specrom Analytics, providing content, email, social marketing, and social listening products and services using web crawling/scraping and advanced text mining.

Jay worked at the US Environmental Protection Agency (EPA) for five years where he designed workflows to crawl and extract useful insights from hundreds of thousands of documents that were parts of regulatory filings from companies. He also led one of the first research teams within the agency to use Apache Spark-based workflows for chem and bioinformatics applications such as chemical similarities and quantitative structure activity relationships. He developed recurrent neural networks and more advanced LSTM models in Tensorflow for chemical SMILES generation.

Jaygraduated with a bachelor's degree in engineering from the Institute of Chemical Technology, University of Mumbai, India and a master of science degree from the University of Georgia, USA. Jay serves as an editor of a publication titled Web Data Extraction and also blogs about personal projects, open source packages, and experiences as a startup founder on his personal site, jaympatel.com. 



Jay M. Patel is a software developer with over 10 years of experience in data mining, web crawling/scraping, machine learning, and natural language processing (NLP) projects. He is a co-founder and principal data scientist of Specrom Analytics, providing content, email, social marketing, and social listening products and services using web crawling/scraping and advanced text mining.

Jay worked at the US Environmental Protection Agency (EPA) for five years where he designed workflows to crawl and extract useful insights from hundreds of thousands of documents that were parts of regulatory filings from companies. He also led one of the first research teams within the agency to use Apache Spark-based workflows for chem and bioinformatics applications such as chemical similarities and quantitative structure activity relationships. He developed recurrent neural networks and more advanced LSTM models in Tensorflow for chemical SMILES generation.

Jaygraduated with a bachelor's degree in engineering from the Institute of Chemical Technology, University of Mumbai, India and a master of science degree from the University of Georgia, USA. Jay serves as an editor of a publication titled Web Data Extraction and also blogs about personal projects, open source packages, and experiences as a startup founder on his personal site, jaympatel.com. 

Specifications

  • Publisher
    APress
  • Pub date
    Nov 2020
  • Theme
    Programming and scripting languages: general
  • Dimensions
    254 x 178 mm
  • EAN
    9781484265758
  • Paperback
    Paperback
  • Language
    English

related products