Quick Start Guide

This quick start guide will help you get up and running with GatherHub in just a few minutes. Follow these steps to start archiving content from various online sources.

Note: This guide assumes you have already completed the installation process.

Starting GatherHub

GatherHub can run in several modes depending on your needs:

Mode	Command	Description
Web Interface	`./gatherhub --web`	Starts the web interface for managing downloads through your browser
API Server	`./gatherhub --api`	Starts the API server for programmatic access
Background Service	`./gatherhub --daemon`	Runs in the background to automatically process downloads
All-in-one	`./gatherhub --web --api --daemon`	Starts all components (recommended for most users)

First Time Setup

When running GatherHub for the first time, it's recommended to access the web interface to configure settings:

Start GatherHub with the web interface: ./gatherhub --web
Open a web browser and navigate to http://localhost:8060
If prompted, log in with the default credentials (usually admin/admin)
Follow the setup wizard if available, or proceed to the Settings page

Adding Content Sources

GatherHub can import content from various sources:

Browser Bookmarks

Go to Settings > Sources
Click "Add Source"
Select "Firefox Bookmarks" or other browser type
Specify the path to your browser's profile folder or bookmark database (if not path is given it will try to autodetect)
Save the configuration

Manual URL Addition

Navigate to the Jobs page
Click "Add New Job"
Click on Upload Files or Add URLs
Select one or more files (if Upload) or one or more URLs
Optional: Select the appropriate media type (HTML, Videos, Images, etc.)
Click "Add"

Scanning for Content

If you have the daemon running it will automatically scan for new content at the frequency specified in the settings Otherwise to manually scan your configured sources for content:

Go to the Dashboard
Click "Scan Sources" in the Quick Actions section
Review the new content found in the Jobs page

Starting Downloads

If you have the daemon running it will automatically downlad jobs at the frequency specified in the settings Otherwise to manually start processing downloads:

Go to the Dashboard
Click "Process Downloads" in the Quick Actions section
Alternatively, if running in daemon mode, downloads will be processed automatically according to your scheduling settings

Monitoring Progress

You can monitor download progress in several ways:

Dashboard: Shows an overview of all jobs by status
Jobs Page: Provides detailed information and filtering options
Logs: Check app.log and activity.log for detailed operation information

Special Media Types

Some media types require additional configuration:

YouTube Videos

To download YouTube videos (especially age-restricted or private content), you need to configure cookies:

Go to Settings > Cookie Settings
Follow the guide to export YouTube cookies from your browser
Upload the cookies.txt file

Git Repositories

Git repositories are cloned and also made available as ZIP archives (via event hook) for download through the web interface. No special configuration is needed for public repositories.

Web Archives

Web Archive will never be automatically detected and must be manually selected. You cannot mix and match if you enter more than one url; all urls will be crawled and archived if that media type is selected.

Next Steps

After getting started, consider exploring these features:

Event Hooks - For customizing the download workflow
Tagging System - For organizing your content
Scheduling - For automating regular downloads
API Access - For integrating with other systems