Skip to contents

Get metadata about available partitions in the cricsheet release. Useful for deciding which partition files to download.

Usage

get_remote_partition_info(
  repo = "peteowen1/bouncerdata",
  tag = "cricsheet",
  force = FALSE
)

Arguments

repo

Character. GitHub repository. Default "peteowen1/bouncerdata".

tag

Character. Release tag. Default "cricsheet".

force

Logical. Force re-download even if cached. Default FALSE.

Value

Data frame with columns:

  • partition: Partition key (e.g., "T20_male")

  • match_count: Number of matches in partition

  • file_size_bytes: Size of parquet file

  • file_size_mb: Size in MB

Examples

if (FALSE) { # \dontrun{
partitions <- get_remote_partition_info()
partitions[order(-partitions$file_size_mb), ]  # Largest first
} # }