Greenhouse Connector Overview

Greenhouse is an applicant tracking system and recruiting software, is designed to help make companies great at hiring, improving the process for everyone.

The Datacoral Greenhouse connector collects data from the Greenhouse Harvest API and enables data flow into a data warehouse, such as Redshift or Snowflake.

Features & Capabilities

  • Backfill: Full historical sync of your entire data
  • Data Extraction Modes: snapshot, incremental with pagination
  • Data Load Modes: replace, append and merge
  • Tables and Columns selection: Ability to select tables and columns
  • Customizations: Update the configurations easily using the UI
  • Scheduling: Highly flexible scheduling system
  • Capture Deletes: Setup webhooks to capture deleted records in Greenhouse objects

Supported load units

The Greenhouse connector automatically collects the following loadunits from the Greenhouse API and makes them available in your warehouse for analysis.

application/applicationsCaptures all the attributes for applications
approvals/jobs/{id}/approval_flowsCapture all the attributes for approvals
candidates/candidatesCaptures all the first-level attributes for candidates.
demographic_answer_options/demographics/answer_optionsCapture all the attributes for demographic_answer_options
demographic_answers/demographics/answersCapture all the attributes for The demographic answers
demographic_question_sets/demographics/question_setsCapture all the attributes for demographic question sets
demographic_questions/demographics/questionsCapture all the attributes for demographic questions
departments/departmentsCapture all the attributes for departments
eeoc/eeocCapture all the attributes for eeoc
job_openings/jobs/{job_id}/openingsCapture all the attributes for job openings
job_posts/job_postsCapture all the attributes for job posts
job_stages/job_stagesCapture all the attributes for job stages
jobs/jobsCapture all the first-level attributes for jobs.
offers/offersCapture all the attributes for offers
offices/officesCapture all the attributes for offices
rejection_reasons/rejection_reasonsCapture all the attributes for rejection reasons
scheduled_interviews/scheduled_interviewsCapture all the attributes for scheduled_interviews
scorecards/scorecardsCapture all the attributes for scorecards
sources/sourcesCapture all the attributes for sources
tags/tags/candidateCapture all the attributes for tags
user_permissions/users/{id}/permissions/jobsCapture all the attributes for user_permissions
user_roles/user_rolesCapture all the attributes for user_roles
users/usersCapture all the attributes for users

Connector output

Output of this connector is stored in S3 and Redshift.

AWS S3 Data stored in AWS S3 is partitioned by date and time s3://customer_installation.datacoral/<connector-name>

Destination warehouse: Schema - schema name will be same as the connector name. Tables produced by the connector are:

- schema.application
- schema.approvals
- schema.candidates
- schema.demographic_answer_options
- schema.demographic_answers
- schema.demographic_question_sets
- schema.demographic_questions
- schema.departments
- schema.eeoc
- schema.job_openings
- schema.job_posts
- schema.job_stages
- schema.offers
- schema.offices
- schema.rejection_reasons
- schema.scheduled_interviews
- schema.scorecards
- schema.sources
- schema.tags
- schema.user_permissions
- schema.user_roles
- schema.users

Next Steps

Additional Information

Got a question?

Please contact Datacoral's Support Team, we'd be more than happy to answer any of your questions.