Skip to contents

Appends/overwrites a column (default: "exist") filled with 1L on the lazy data_long table. Preserves laziness; no collection is forced.

Usage

add_exist(phip_data, exist_col = "exist", overwrite = FALSE)

Arguments

phip_data

A <phip_data> object.

exist_col

Name of the existence column to append/overwrite.

overwrite

If FALSE and the column exists, abort with a phiperio-style error.

Value

Modified <phip_data> with updated data_long.

Examples

pd <- load_example_data()
#> [13:45:11] INFO  Constructing <phip_data> object
#>                  -> create_data()
#> [13:45:11] INFO  Fetching peptide metadata library via get_peptide_library()
#> [13:45:11] INFO  Retrieving peptide metadata into DuckDB cache
#>                  -> get_peptide_library(force_refresh = FALSE)
#> [13:45:11] INFO  Opened DuckDB connection
#>                    - cache dir:
#>                      /home/runner/.cache/R/phiperio/peptide_meta/phip_cache.duckdb
#>                    - table: peptide_meta
#> [13:45:11] INFO  Starting download
#>                    - dest:
#>                      /home/runner/.cache/R/phiperio/peptide_meta/combined_library_15.01.26.rds
#> [13:45:11] OK    Download succeeded (method = <getOption()>)
#> [13:45:12] OK    Checksum verified (SHA-256 match)
#> [13:45:14] OK    Download complete and loaded into R
#> [13:45:20] INFO  Importing sanitized metadata into DuckDB cache...
#> [13:45:21] OK    peptide_meta table created in DuckDB cache
#> [13:45:21] OK    Retrieving peptide metadata into DuckDB cache - done
#>                  -> elapsed: 10.214s
#> [13:45:21] OK    Peptide metadata acquired
#> [13:45:21] INFO  Validating <phip_data>
#>                  -> validate_phip_data()
#> [13:45:21] INFO  Checking structural requirements (shape & mandatory columns)
#> [13:45:21] INFO  Checking outcome family availability (exist / fold_change /
#>                  raw_counts)
#> [13:45:21] INFO  Checking collisions with reserved names
#>                    - subject_id, sample_id, timepoint, peptide_id, exist,
#>                      fold_change, counts_input, counts_hit
#> [13:45:21] INFO  Ensuring all columns are atomic (no list-cols)
#> [13:45:21] INFO  Checking key uniqueness
#> [13:45:21] INFO  Validating value ranges & types for outcomes
#> Warning: Missing values are always removed in SQL aggregation functions.
#> Use `na.rm = TRUE` to silence this warning
#> This warning is displayed once every 8 hours.
#> [13:45:21] INFO  Assessing sparsity (NA/zero prevalence vs threshold)
#>                    - warn threshold: 50%
#> [13:45:22] INFO  Checking peptide_id coverage against peptide_library
#> Warning: [13:45:22] WARN  peptide_id not found in peptide_library (e.g. 10003)
#>                  -> peptide library coverage.
#> [13:45:22] INFO  Checking full grid completeness (peptide * sample)
#> Warning: [13:45:22] WARN  Counts table is not a full peptide * sample grid.
#>                  -> grid completeness
#>                    - observed rows: 78200
#>                    - expected rows: 156000.
#> Warning: [13:45:22] WARN  Grid remains incomplete (auto_expand = FALSE).
#>                  -> grid completeness
#>                    - observed rows: 78200
#>                    - expected rows: 156000.
#> [13:45:22] OK    Validating <phip_data> - done
#>                  -> elapsed: 0.667s
#> [13:45:22] OK    Constructing <phip_data> object - done
#>                  -> elapsed: 10.884s
pd <- add_exist(pd, overwrite = TRUE) # overwrites if present
#> [13:45:22] INFO  Ensuring existence flag on data_long
#>                  -> column: 'exist'; overwrite: TRUE
#> Warning: [13:45:22] WARN  Overwriting existing existence flag.
#>                  -> adding existence indicator
#>                    - column: "exist".
#> [13:45:22] OK    Ensuring existence flag on data_long - done
#>                  -> elapsed: 0.009s