Joint work with Prof Dianne Cook, Dr Paul Harrison, Dr Michael Lydeamore, Dr Thiyanga S. Talagala
Lack of control over cluster shapes
cardinalR
collection of various high-dimensional data structures in R
Generation of geometric structures in arbitrary dimensions.
Control over background noise, clustering,
and sample positioning.
Generation of explainable, challenging synthetic datasets for
benchmarking high-dimensional methods.
Branching
Cone
Cube
Linear
Mobius
Polynomial
Pyramid
Scurve
Sphere
Swiss roll
Trefoil
Trigonometric
Implementation
Letโs combine !!!
Different shaped clusters (shape
) with different
sample sizes (n
)
location (loc
)
scale (scale
)
rotation (rotation
)
with or without background noise (is_bkg
)
# A tibble: 1,500 ร 5
x1 x2 x3 x4 cluster
<dbl> <dbl> <dbl> <dbl> <chr>
1 -0.109 -0.0144 0.108 -0.0820 cluster2
2 -0.450 -0.650 -0.122 0.0414 cluster1
3 -0.496 -0.411 0.230 0.0438 cluster1
4 -0.0147 0.0454 -0.0634 -0.0633 cluster2
5 0.370 -0.539 0.0263 0.0287 cluster1
6 -0.0122 -0.690 0.0270 0.0710 cluster1
# โน 1,494 more rows
loc_matrix <- matrix(
c(0, 0, 0, 0,
5, 9, 0, 0,
3, 4, 10, 7
), nrow = 3, byrow = TRUE)
multigau <- make_multigau(n = c(300, 200, 500), p = 4, k = 3, loc = loc_matrix, scale = c(0.2, 1.5, 0.5))
multigau
# A tibble: 1,000 ร 5
x1 x2 x3 x4 cluster
<dbl> <dbl> <dbl> <dbl> <chr>
1 2.89 4.13 10.1 7.15 cluster3
2 3.06 3.98 9.92 7.00 cluster3
3 0.0125 0.144 -0.0449 0.0189 cluster1
4 3.01 4.12 9.94 6.99 cluster3
5 5.46 9.39 -0.598 -0.380 cluster2
6 2.83 4.05 10.1 6.78 cluster3
# โน 994 more rows
positions <- geozoo::simplex(p=4)$points
positions <- positions * 0.8
## To generate data
five_clusts <- gen_multicluster(n = c(2250, 1500, 750, 1250, 1750),
p = 4, k = 5, loc = positions,
scale = c(0.4, 0.35, 0.3, 1, 0.3),
shape = c("helicalspiral",
"hemisphere", "unifcube",
"cone", "gaussian"),
rotation = NULL,
is_bkg = FALSE)
Data
Dimension reduction layouts
p
) and number of clusters (k
)loc
matrix"pyrstar"
, "conicspiral"
, "hemisphere"
, etc.is_bkg = TRUE
) to simulate real-world conditionscardinalR
empowers researchers To:Jayani P.G. Lakshika
Collaborators: Prof Dianne Cook, Dr Paul Harrison, Dr Michael Lydeamore, Dr Thiyanga S. Talagala