Search

Beitao Li Phones & Addresses

  • Weston, FL
  • 4 Wilson Ln, Metuchen, NJ 08840
  • Middlesex, NJ
  • 3138 Riverside Station Blvd, Secaucus, NJ 07094
  • 153 Sandpiper Ky, Secaucus, NJ 07094
  • 6739 El Colegio Rd #120, Goleta, CA 93117
  • Santa Barbara, CA
  • Edison, NJ
  • Piscataway, NJ

Work

Company: Tumblr Nov 2017 Position: Director of data science

Education

Degree: Doctorates, Doctor of Philosophy School / High School: Uc Santa Barbara 1999 to 2003 Specialities: Mining

Skills

Machine Learning • Information Retrieval • Data Mining • Algorithms • Pattern Recognition • C++ • Perl • C • Linux • Search • Distributed Systems • Natural Language Processing • Image Processing • Python • Mapreduce • Hbase • Recommender Systems • Computer Science • Information Extraction • Matlab • Sql • Text Mining • Hadoop • Artificial Intelligence • Text Classification • Text Analytics • R • Java • Ruby • Data Analysis • Solr • Big Data • Scalability • Scala • Git • Lucene • Mongodb • Search Engine Technology • Amazon Ec2 • Seo • C/C++ Stl • Php • Cloud Computing • Sem • Pig • Hive • Scalding • Spark

Languages

Mandarin • English

Interests

E Commerce • Table Tennis • Soccer • Chess • Numbers • History

Industries

Internet

Resumes

Resumes

Beitao Li Photo 1

Director Of Data Science

View page
Location:
201 west 7Th St, Richmond, VA 23219
Industry:
Internet
Work:
Tumblr
Director of Data Science

Tumblr May 1, 2015 - Oct 2017
Senior Engineering Manager

Tumblr Dec 1, 2012 - Apr 2015
Search Engineer

Etsy Jul 2011 - Nov 2012
Principal Software Engineer

Ask.com Aug 2003 - Jun 2011
Engineering Manager and Principal Software Engineer
Education:
Uc Santa Barbara 1999 - 2003
Doctorates, Doctor of Philosophy, Mining
Stanford University 1997 - 1999
Master of Science, Masters
University of Science and Technology of China 1992 - 1997
Bachelors, Bachelor of Science
Skills:
Machine Learning
Information Retrieval
Data Mining
Algorithms
Pattern Recognition
C++
Perl
C
Linux
Search
Distributed Systems
Natural Language Processing
Image Processing
Python
Mapreduce
Hbase
Recommender Systems
Computer Science
Information Extraction
Matlab
Sql
Text Mining
Hadoop
Artificial Intelligence
Text Classification
Text Analytics
R
Java
Ruby
Data Analysis
Solr
Big Data
Scalability
Scala
Git
Lucene
Mongodb
Search Engine Technology
Amazon Ec2
Seo
C/C++ Stl
Php
Cloud Computing
Sem
Pig
Hive
Scalding
Spark
Interests:
E Commerce
Table Tennis
Soccer
Chess
Numbers
History
Languages:
Mandarin
English

Publications

Us Patents

Similarity Detection And Clustering Of Images

View page
US Patent:
7801893, Sep 21, 2010
Filed:
Sep 30, 2005
Appl. No.:
11/242390
Inventors:
Antonino Gulli' - Pisa, IT
Antonio Savona - Sora, IT
Tao Yang - Santa Barbara CA, US
Xin Liu - Piscataway NJ, US
Beitao Li - Piscataway NJ, US
Ankur Choksi - Somerset NJ, US
Filippo Tanganelli - Castiglioncello, IT
Luigi Carnevale - Pisa, IT
Assignee:
IAC Search & Media, Inc. - Oakland CA
International Classification:
G06F 17/30
US Classification:
707737, 707758
Abstract:
A system and method for determining if a set of images in a large collection of images are near duplicates allows for improved management and retrieval of images. Images are processed, image signatures are generated for each image in the set of images, and the generated image signatures are compared. Detecting similarity between images can be used to cluster and rank images.

Dynamic Partial Function In Measurement Of Similarity Of Objects

View page
US Patent:
20030088387, May 8, 2003
Filed:
Sep 24, 2002
Appl. No.:
10/255158
Inventors:
Edward Chang - Santa Barbara CA, US
Beitao Li - Goleta CA, US
International Classification:
H03F001/26
G06F015/00
H04B015/00
US Classification:
702/196000
Abstract:
A method of measuring similarity of a first object represented by first set of feature values to a second object represented by a second set of feature values, comprising determining respective feature distance values between substantially all corresponding feature values of the first and second sets of feature values, selecting a subset of the determined feature distance values in which substantially all feature distance values that are selected to be within the subset are smaller in value than feature distance values that are not selected to be within the subset, and summing the feature distance values in the subset to produce a partial feature distance measure between the first and second objects.
Beitao A Li from Weston, FL, age ~47 Get Report