Overview

Collects agent details from homes.com.

Metadata

Frequency (UTC) Source Dataset ID
12 1 1 1 1 real_estate

Data Dictionary

Column Name Description Type Mode
agentKey unique identifier for the agent string NULLABLE
hasProPlus whether the agent has a pro plus account boolean NULLABLE
reviewCount the number of reviews the agent has number NULLABLE
companyName the name of the agent's company string NULLABLE
city the agent's city string NULLABLE
state the agent's state string REQUIRED
zip the agent's zip code string NULLABLE
isVerified whether the agent is verified boolean NULLABLE
buyerDealCount the number of buyer deals the agent has number NULLABLE
sellerDealCount the number of seller deals the agent has number NULLABLE
yearsExperience the number of years the agent has been in the industry number NULLABLE
closedSales the number of closed sales the agent has number NULLABLE
totalValue the total value of the agent's sales number NULLABLE
minimumDealValue the minimum value of the agent's sales number NULLABLE
maximumDealValue the maximum value of the agent's sales number NULLABLE
averageDealValue the average value of the agent's sales number NULLABLE
officeKey the unique identifier for the agent's office string NULLABLE
isMember whether the agent is a member boolean NULLABLE
pageNotFound whether the agent's page was not found (some pages may be taken down over time) boolean NULLABLE
dateCollected The date that the data was collected timestamp REQUIRED
dateRepresented The date that the data in this row represents. This is not necessarily the date the data was collected (e.g. pulling from web archives) timestamp REQUIRED
attemptID The attemptID of the task that produced this row string REQUIRED
invocationID Also known as run ID. The invocationID is associated with a full run written to production string REQUIRED
taskID The unique identifier of the task that produced this row - can be from a chain or write task string REQUIRED

Additional Fields

Removed due to possibility to be classified as PII

Column Name Description Type Mode
stateLicenseNumber the agent's state license number string NULLABLE
fullStreetAddress the agent's full street address string NULLABLE
agentDetailUrl the URL of the agent's detail page string REQUIRED

Known Issues

  1. state column - Not all rows have states in two letter capitalized format. E.g. almost all California agents will have state column as "CA", but a few will have "California", "ca" etc.
  2. companyName column - Same issue with companyName, the companies are not standardized.
  3. Outliers - Very few data points are not believable but is what’s listed on the website so we collected it as is. We would recommend discarding any data points that are more than 4 standard deviations outside the norm. Examples:
    1. Agent with ID “y36wprt” has 135 years of experience
    2. Some agents have averageDealValue = $1