Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Maximum matching algorithm in experimental #254

Open
wants to merge 71 commits into
base: v1.2
Choose a base branch
from
Open
Changes from 1 commit
Commits
Show all changes
71 commits
Select commit Hold shift + click to select a range
3a9d316
Add initial commit of maximum matching
kchristin22 Jun 5, 2024
2906e70
Fix initialization of frontierC and add test for debugging purposes
kchristin22 Jun 6, 2024
e2bb41a
Add test of semiring in MCM
kchristin22 Jun 7, 2024
d8b48fa
Discard visited rows and Set parents of newly visited rows
kchristin22 Jun 7, 2024
9f49b4c
Compute mateR through inverting mateC
kchristin22 Jun 8, 2024
ee80f27
Update pathC
kchristin22 Jun 10, 2024
efbc6ea
Fix vector_build usage
kchristin22 Jun 12, 2024
402f381
Draft: Add PRUNE function
kchristin22 Jun 12, 2024
23d648b
Finish PRUNE function
kchristin22 Jun 13, 2024
8c761f7
Improve PRUNE, finish inner loop body and clean up
kchristin22 Jun 14, 2024
5112c8d
Implement comments
kchristin22 Jun 20, 2024
d28e141
Draft: fix previous build warning, add augment and test multiple iter…
kchristin22 Jun 20, 2024
18f23b2
Use mateR copy for each iteration and check if vectors are empty befo…
kchristin22 Jun 21, 2024
80a08e4
Improve descriptor in ur assignment and add explanation in a comment
kchristin22 Jun 21, 2024
582a59c
Fix how parents of fR are updated
kchristin22 Jun 24, 2024
6671d1e
Delete unnecessary allocations and wrong deallocations to avoid memor…
kchristin22 Jun 24, 2024
c9b5670
Use jumble flag of pack when inverting and allow indices to be unsort…
kchristin22 Jun 24, 2024
42479d7
Fix LAGraph_Free calls
kchristin22 Jun 24, 2024
610304f
Make I a bool vector, add another vector to copy to output and free w…
kchristin22 Jun 25, 2024
2c8c11c
Add the burble and evaluation of the matching in MCM test
kchristin22 Jun 25, 2024
f872354
Fix shared jumbled var among vectors bug
kchristin22 Jun 26, 2024
f3a5c0b
Test with a matrix from the data set of LAGraph
kchristin22 Jun 26, 2024
4032a53
Add second test and comment out any debugging prints
kchristin22 Jun 27, 2024
bddfc36
Specify nvals name per vector
kchristin22 Jun 27, 2024
eaf9d0e
Parametrize tests of multiple data sets
kchristin22 Jun 27, 2024
52fce12
Change input graph to matrix to support rectangular adjacency matrices
kchristin22 Jun 27, 2024
4091ddb
Change input graph of MaximumMatching to matrix
kchristin22 Jun 27, 2024
31c67d1
Use ur vector to compute mateR instead of inverting mateC
kchristin22 Jun 28, 2024
325e9bc
Fix and improve the algorithm
kchristin22 Jul 1, 2024
3b1b6a6
Merge remote-tracking branch 'upstream/v1.2' into maximum-matching
kchristin22 Jul 3, 2024
0faf31a
Revert vscode changes
kchristin22 Jul 3, 2024
ccd3a56
Merge apply and assign operations for the parentsR update
kchristin22 Jul 3, 2024
3239a8f
Change update operations to use a mask instead of an extension accumu…
kchristin22 Jul 3, 2024
f7f5f75
Use extracted jumbled var for mateC pack operation
kchristin22 Jul 3, 2024
0752b16
Add test cases where JIT is off and the mateC is not empty at the beg…
kchristin22 Jul 5, 2024
f8ee5d3
burble
DrTimothyAldenDavis Jul 5, 2024
c5bd7bd
July 5th code session
DrTimothyAldenDavis Jul 5, 2024
32152a5
Work on FIXME comments
kchristin22 Jul 8, 2024
180f964
Finish helper invert functions and polish them
kchristin22 Jul 8, 2024
5a09563
Specify the steps of the algorithm as indicated in the paper
kchristin22 Jul 8, 2024
b691eac
Initial benchmark for MCM
kchristin22 Jul 9, 2024
dea5618
Add push-pull optimization and enable it in tests and benchmarks
kchristin22 Jul 11, 2024
cedcc81
comments and timing
DrTimothyAldenDavis Jul 11, 2024
8b086f7
Fix mateC initialization on the test
kchristin22 Jul 11, 2024
bd45f2e
starting to add A,AT options
DrTimothyAldenDavis Jul 11, 2024
6a40f82
.
DrTimothyAldenDavis Jul 11, 2024
141df7b
add check of input/output of invert
DrTimothyAldenDavis Jul 11, 2024
1f18d60
Fix setting of mateR and inverting of vr in the augment function
kchristin22 Jul 11, 2024
96839fa
debugging for Orkut matrix, added check_matching
DrTimothyAldenDavis Jul 12, 2024
ce5ead5
Print number of matches in the benchmark
kchristin22 Jul 12, 2024
e41787e
Merge remote-tracking branch 'refs/remotes/origin/maximum-matching' i…
kchristin22 Jul 12, 2024
d63aa7e
Clear frontierC before updating it and clean up
kchristin22 Jul 16, 2024
dd62fd4
Add check of the matching in benchmark and fix memory leakage in tests
kchristin22 Jul 16, 2024
9aed507
Revert mateR update method and use one trial for the benchmarks
kchristin22 Jul 16, 2024
df0c3e0
Apply mask separately in vxm
kchristin22 Jul 17, 2024
138d4bc
Use 3 trails in benchamrking for a better sample than having just one
kchristin22 Jul 17, 2024
7373fb4
Fix push-only method
kchristin22 Jul 23, 2024
6b593a9
Allow mateR as initial matching and add mateR as output
kchristin22 Jul 23, 2024
7788238
First half of vanilla version: missing invert2
kchristin22 Jul 24, 2024
edb3df9
Merge vxm and mask operations again for testing
kchristin22 Jul 27, 2024
a06981c
Fix user defined functions
kchristin22 Oct 4, 2024
be8f155
Fix vanilla version
kchristin22 Nov 11, 2024
65c6f1a
Add brief paper description and lose debug comments
kchristin22 Nov 11, 2024
ee1ba8f
Rename to advanced method
kchristin22 Nov 11, 2024
85d8236
Turn swap of pathC and pathCopy into an assign op
kchristin22 Nov 16, 2024
6a84ab8
Split single vxm and mask assign op in two
kchristin22 Nov 16, 2024
04a0880
Flush after print in mcm_demo
kchristin22 Nov 16, 2024
9cc7573
Use 16 trials for mcm benchmark
kchristin22 Nov 18, 2024
d5bbc78
Use swap instead of assign and fix format of files
kchristin22 Nov 18, 2024
3ea156f
Add doc and poster for Maximum Matching
kchristin22 Nov 18, 2024
3df78b9
Remove forgotten parentsR mask in vxm
kchristin22 Nov 19, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
First half of vanilla version: missing invert2
kchristin22 committed Jul 24, 2024
commit 77882380270b031d3f11be0d404db8d1cff34fd6
108 changes: 95 additions & 13 deletions experimental/algorithm/LAGraph_MaximumMatching.c
Original file line number Diff line number Diff line change
@@ -215,12 +215,21 @@ static inline GrB_Info invert_nondestructive(
ASSERT(in != out);

// All input/output vectors must be of type GrB_UINT64.

#if LAGRAPH_SUITESPARSE
GRB_TRY(
GxB_Vector_unpack_CSC(in, (GrB_Index **)&I, (void **)&X1, &IBytes,
&XBytes, NULL, &nvals, &jumbled,
NULL)); // the output and input should have no
// duplicates, so the order doesn't matter
#else
GRB_TRY(GrB_Vector_nvals(&nvals, in));
LG_TRY(LAGraph_Malloc((void **)&I, nvals, sizeof(GrB_Index), msg));
LG_TRY(LAGraph_Malloc((void **)&X1, nvals, sizeof(GrB_Index), msg));
GRB_TRY(GrB_Vector_extractTuples_UINT64(
I, X1, &nvals, in)); // the output and input should have no
// duplicates, so the order doesn't matter
#endif

GRB_TRY(GrB_Vector_clear(out)); // clear the output first as a prerequisite
// of the build method
GRB_TRY(GrB_Vector_build_UINT64(
@@ -230,8 +239,10 @@ static inline GrB_Info invert_nondestructive(
// again to the input
// the input should have no duplicates in the
// values list, so dups are not handled
#if LAGRAPH_SUITESPARSE
GRB_TRY(GxB_Vector_pack_CSC(in, (GrB_Index **)&I, (void **)&X1, IBytes,
XBytes, NULL, nvals, jumbled, NULL));
#endif
}

static inline GrB_Info
@@ -254,18 +265,32 @@ invert(GrB_Vector out, // input/output. Same as invert_nondescructive above.
GrB_Index IBytes = 0, XBytes = 0;
uint64_t nvals = 0;

// #if LAGRAPH_SUITESPARSE
#if LAGRAPH_SUITESPARSE
GRB_TRY(GxB_Vector_unpack_CSC(in, (GrB_Index **)&I, (void **)&X1, &IBytes,
&XBytes, NULL, &nvals, &jumbled,
NULL)); // #else
&XBytes, NULL, &nvals, &jumbled, NULL));
#else
// vanilla case using extractTuples and build:
// allocate I and X GrB_extractTuples(I, X, in, ...) GrB_build(out, X, I,
// ...) free I and X: LG_FREE_ALL;
// #endif
// allocate I and X for GrB_extractTuples
GRB_TRY(GrB_Vector_nvals(&nvals, in));
LG_TRY(LAGraph_Malloc((void **)&I, nvals, sizeof(GrB_Index), msg));
LG_TRY(LAGraph_Malloc((void **)&X1, nvals, sizeof(GrB_Index), msg));
GRB_TRY(GrB_Vector_extractTuples_UINT64(
I, X1, &nvals, in)); // the output and input should have no
// duplicates, so the order doesn't matter
GRB_TRY(GrB_Vector_clear(in));
#endif
if (!dups)
{
#if LAGRAPH_SUITESPARSE
GRB_TRY(GxB_Vector_pack_CSC(out, (GrB_Index **)&X1, (void **)&I, XBytes,
IBytes, NULL, nvals, true, NULL));
#else
GRB_TRY(GrB_Vector_clear(out));
// GrB_MIN_UINT64 is used instead of first because first is an extension
GRB_TRY(GrB_Vector_build_UINT64(out, X1, I, nvals, GrB_MIN_UINT64));
// build copies the lists so they need to be freed in LG_FREE_ALL
LG_FREE_ALL;
#endif
}
else
{
@@ -458,41 +483,75 @@ int LAGraph_MaximumMatching(

GRB_TRY(GrB_Vector_new(&parentsR, GrB_UINT64, nrows));

#if LAGRAPH_SUITESPARSE
GRB_TRY(GxB_Type_new(&Vertex, sizeof(vertex), "vertex", VERTEX_DEFN));
#else
GRB_TRY(GrB_Type_new(&Vertex, sizeof(vertex)));
#endif

GRB_TRY(GrB_Vector_new(&frontierC, Vertex, ncols));

GRB_TRY(GrB_Vector_new(&frontierR, Vertex, nrows));

#if LAGRAPH_SUITESPARSE
GRB_TRY(GxB_IndexUnaryOp_new(&initFrontierOp, (void *)initFrontier, Vertex,
GrB_BOOL, GrB_BOOL, "initFrontier",
INIT_FRONTIER_DEFN));
#else
GRB_TRY(GrB_IndexUnaryOp_new(&initFrontierOp, (void *)initFrontier, Vertex,
GrB_BOOL, GrB_BOOL));
#endif

GRB_TRY(GrB_Vector_new(&I, GrB_BOOL, ncols));
GRB_TRY(GrB_Vector_assign_BOOL(I, NULL, NULL, 1, GrB_ALL, ncols, NULL));

#if LAGRAPH_SUITESPARSE
GRB_TRY(GxB_BinaryOp_new(&MinParent, (void *)minparent, Vertex, Vertex,
Vertex, "minparent", MIN_PARENT_DEFN));
#else
GRB_TRY(GrB_BinaryOp_new(&MinParent, (void *)minparent, Vertex, Vertex,
Vertex));
#endif
vertex infinityParent = {GrB_INDEX_MAX + 1, 0};
GRB_TRY(GrB_Monoid_new_UDT(&MinParent_Monoid, MinParent, &infinityParent));

#if LAGRAPH_SUITESPARSE
GRB_TRY(GxB_BinaryOp_new(&Select2ndOp, (void *)select2nd, Vertex, GrB_BOOL,
Vertex, "select2nd", SELECT_2ND_DEFN));
#else
GRB_TRY(GrB_BinaryOp_new(&Select2ndOp, (void *)select2nd, Vertex, GrB_BOOL,
Vertex));
#endif

GRB_TRY(GrB_Semiring_new(&MinParent_2nd_Semiring, MinParent_Monoid,
Select2ndOp));

#if LAGRAPH_SUITESPARSE
GRB_TRY(GxB_BinaryOp_new(&Select1stOp, (void *)select1st, Vertex, Vertex,
GrB_BOOL, "select1st", SELECT_1ST_DEFN));
#else
GRB_TRY(GrB_BinaryOp_new(&Select1stOp, (void *)select1st, Vertex, Vertex,
GrB_BOOL));
#endif

GRB_TRY(GrB_Semiring_new(&MinParent_1st_Semiring, MinParent_Monoid,
Select1stOp));

#if LAGRAPH_SUITESPARSE
GRB_TRY(GxB_UnaryOp_new(&getParentsOp, (void *)keepParents, GrB_UINT64,
Vertex, "keepParents", KEEP_PARENTS_DEFN));
#else
GRB_TRY(GrB_UnaryOp_new(&getParentsOp, (void *)keepParents, GrB_UINT64,
Vertex));
#endif

#if LAGRAPH_SUITESPARSE
GRB_TRY(GxB_UnaryOp_new(&getRootsOp, (void *)keepRoots, GrB_UINT64, Vertex,
"keepRoots", KEEP_ROOTS_DEFN));
#else
GRB_TRY(
GrB_UnaryOp_new(&getRootsOp, (void *)keepRoots, GrB_UINT64, Vertex));
#endif

GRB_TRY(GrB_Vector_new(&parentsUpdate, GrB_UINT64, nrows));

@@ -508,17 +567,32 @@ int LAGraph_MaximumMatching(

GRB_TRY(GrB_Vector_new(&rootfRIndexes, GrB_UINT64, ncols));

#if LAGRAPH_SUITESPARSE
GRB_TRY(GxB_IndexUnaryOp_new(&buildfCTuplesOp, (void *)buildfCTuples,
Vertex, GrB_UINT64, GrB_BOOL, "buildfCTuples",
BUILT_FC_TUPLES_DEFN));
#else
GRB_TRY(GrB_IndexUnaryOp_new(&buildfCTuplesOp, (void *)buildfCTuples,
Vertex, GrB_UINT64, GrB_BOOL));
#endif

#if LAGRAPH_SUITESPARSE
GRB_TRY(GxB_UnaryOp_new(&vertexTypecastOp, (void *)vertexTypecast, Vertex,
GrB_UINT64, "vertexTypecast",
VERTEX_TYPECAST_DEFN));
#else
GRB_TRY(GrB_UnaryOp_new(&vertexTypecastOp, (void *)vertexTypecast, Vertex,
GrB_UINT64));
#endif

#if LAGRAPH_SUITESPARSE
GRB_TRY(GxB_BinaryOp_new(&setParentsMatesOp, (void *)setParentsMates,
Vertex, Vertex, Vertex, "setParentsMates",
SET_PARENTS_MATES_DEFN));
#else
GRB_TRY(GrB_BinaryOp_new(&setParentsMatesOp, (void *)setParentsMates,
Vertex, Vertex, Vertex));
#endif

GRB_TRY(GrB_Vector_new(&vr, GrB_UINT64, nrows));

@@ -709,13 +783,17 @@ int LAGraph_MaximumMatching(
stdout);
*/

#if LAGRAPH_SUITESPARSE
// keep mates and roots of the R frontier (ordered indices)
LAGRAPH_TRY(
invert_2(rootfRIndexes, currentMatesR, rootsfR, true,
msg)); // rootsfRIndexes(j) = i, where i
// is the col mate of the first
// row included in the current R
// frontier with a col root of j
// is the col mate of the first
// row included in the current R
// frontier with a col root of j
#else
LAGRAPH_TRY(invert(rootfRIndexes, rootsfR, true, msg));
#endif

// keep only col roots that are not included in ufR
GRB_TRY(GrB_Vector_assign(rootfRIndexes, pathUpdate, NULL,
@@ -726,9 +804,11 @@ int LAGraph_MaximumMatching(
// STEP 7a (ufrontierR not empty): Move values in the
// correct positions for the C frontier
//----------------------------------------------------------------------
// rootfRIndexes = invert (rootfRIndexes), so that
// rootfRIndexes(i) = j, where (i,j) = (parentC, rootC) of
// the new frontier C
// rootfRIndexes = invert (rootfRIndexes), so that:
// if LAGRAPH_SUITESPARSE: rootfRIndexes(i) = j,
// where (i,j) = (parentC, rootC) of the new frontier C
// else: rootfRIndexes(i) = j, where i is the matched child
// of the current R frontier that stems from root path
LAGRAPH_TRY(
invert(rootfRIndexes, rootfRIndexes, false, msg));
}
@@ -738,9 +818,11 @@ int LAGraph_MaximumMatching(
// rootC)
//----------------------------------------------------------------------
GRB_TRY(GrB_Vector_clear(frontierC));
#if LAGRAPH_SUITESPARSE
GRB_TRY(GrB_Vector_apply_IndexOp_UDT(frontierC, NULL, NULL,
buildfCTuplesOp,
rootfRIndexes, &y, NULL));
#endif

/* debug
GrB_Index C[ncols];