A better approach replicating 'Curves in Curves'

qythium · April 20, 2018, 5:47am

Yeah I think that’s the main insight here - I’m sure @PeterFotiadis is aware of this, but optimizing a O(n²) algorithm even by a factor of 1000% becomes totally insignificant when you start scaling up the inputs, because the quadratic term very very quickly swallows up any improvements made in linear time.

Using the examples above, even a fast check that works in 500 ms for 857 curves will take 7+ hours for a 200K curve dataset

Using boundary surfaces also appears to suffer from the same O(n²) factor.

Essentially any method using an RTree, however inefficient, will eventually outperform a method involving n² number of checks, because it scales approximately linearly with the size of the data.

I had a go at making the most basic / concise implementation I could (using Python of course )
curve_clusters.gh (113.2 KB)

from Rhino.Geometry import *
from Rhino.RhinoDoc import ActiveDoc as rhdoc
tol = rhdoc.ModelAbsoluteTolerance

ids = range(len(crvs))
# initialize with dictionary of sets containing its own key
# i.e all curves in separate clusters
clusters = {i: set([i]) for i in ids}

rtree = RTree()
for i, crv in enumerate(crvs):
    # populate the RTree with bounding boxes
    rtree.Insert(crv.GetBoundingBox(True), i)


def check_containment(crvA, crvB):
    # checks if either A contains B or B contains A
    # relatively expensive, so further optimizations can be made here eg. for polylines
    rg = Curve.PlanarClosedCurveRelationship(crvA, crvB, Plane.WorldXY, tol)
    return (rg in (RegionContainment.AInsideB, RegionContainment.BInsideA))


def callback(sender, e):
    a, b = e.Id, e.IdB
    if a == b:
        return
    if b in clusters[a] or a in clusters[b]:
        return
    if check_containment(crvs[a], crvs[b]):
        clusters[a].add(b)
        clusters[b].add(a)


RTree.SearchOverlaps(rtree, rtree, 0.01, callback) # O(n)


# iterate through the clusters from the highest to lowest index
for i in reversed(ids):
    # still O(n) despite nested for loop, because inner loop is ~O(1)
    for j in list(clusters[i]):  # create new list to avoid changing the iterable
        # merge with sets of a greater index
        if j > i:
            try:
                clusters[i].update(clusters.pop(j))
            except KeyError:  # j has already been popped
                pass

mapping = [0] * len(crvs)
for k, vals in clusters.items():
    for v in vals:
        mapping[v] = k

num_clusters = len(clusters)

qythium · April 20, 2018, 6:01am

Okay it looks like your C# is ~2x faster ( as expected )
I wonder if there’s a performance difference between using RTree.Search versus RTree.SearchOverlaps due to overhead from all the callbacks…

PeterFotiadis · April 20, 2018, 6:42am

Math.Pow(hmm,hmm) … that’s for the P thingy (what about the Truth Out There ?, he he)

This thread started as any other thread and heads towards other things I confess.

Must clear my mind from doing Panigale custom maps (a useless thing considering the ponies available as standard) and fucus to the Truth Out There (a WIP thread exploiting the truth out there).

Screen%20Shot%20104

AndersDeleuran · April 20, 2018, 7:25am

Great example there, bookmarked

RadovanG · April 20, 2018, 10:21am

Another improvement in special case when we know that curves do not intersect will be to check if curveB.PointAtStart is inside curveA.
Code inside tree.Search would look like:

  tree.Search(bboxA, (to, ta) =>
    {
    int j = ta.Id;
    if (i != ta.Id && !sampled[j])
    {
      Curve crvB = cList[j];
      Point3d ptStart = crvB.PointAtStart;
      var pointContainment = crvA.Contains(ptStart);
      if (pointContainment == PointContainment.Inside)
      {
        if (!sampled[i]) UpdateTrees(crvA, ref curveClustersLocal, ref connClustersLocal, ref sampled, i, branch, true);
        if (!sampled[j]) UpdateTrees(crvB, ref curveClustersLocal, ref connClustersLocal, ref sampled, j, branch, true);
      }
    }
    });

This will shorten time of execution almost to half.

PeterFotiadis · April 20, 2018, 12:29pm

It’s a question of trust the data (as always). But other than that I see that you are a man who walks the walk (as Gythium … if we forget that little detail, he he) > bravo again.

Clusters_CurveContainment_V2B.gh (341.3 KB)

BTW: It’s funny that even a primitive data validation check increases the Elapsed time about 10-15%

PeterFotiadis · April 21, 2018, 4:59am

You know what? In fact we don’t need a RTree at all. Just BBoxes and some canditate BBox questions (but that is what a RTree does anyway [pluts the fact that does N*N comparisons]). Rounding the BBox domains (user controlled option: say in 2 decimals etc helps as well).

See : Mesh Inclusion Error and the useBox magic option. Not 100% the same thing but close. In fact I was ready to use a trick like this … but then you did the RTree thingy and the rest are history.

More in V3 (and some // stuff) … provited that Valentino could deliver some decent results in the Texas MotoGP (for inspiration) . V4 in the mean time would answer the following critical question: given 1M plates full with Swiss cheese and 1Z rats circling around how we can cluster the clouds in the sky on a per rat color basis?

Screen%20Shot%20110

Michael_Pryor · April 23, 2018, 12:23am

This entire thread:

RadovanG · April 30, 2018, 3:43pm

I tried some boxMagic things and with help of mice sudenly I got some code that is permormig well.
Litle bit of some sortings, up-sortings, right-sorting… re-sortings and rememberings what I was just sorting … with assumption that curveA contains curveB only and only if bboxA contains bboxA what should be still valid in this universe. Or maybe not?

Clusters_CurveContainment_v42.gh (5.7 MB)

PeterFotiadis · April 30, 2018, 5:42pm

Indeed the magic this (and magic that) can cut the mustard rather well.

Me? Well … Valentino finished a miserable 4th in Texas (IM miles behind that man, even Iannone got a podium place) meaning that I have absolutely no will for C,D,E … Z# : In fact if I was a real man I should commit seppuku ASAP.

Antonio_Cersosimo · September 8, 2018, 6:10pm

Hi, I was wondering what would be the test condition for the case when the edge of an Inner-Curve overlaps with the edge of the Outer-Curve. Something like this:

In this case the result should give: A{1,3}, B{0,3,4}, C{0,2}.

RIL · September 8, 2018, 6:34pm

Interesting, especially A which seems to be a typical Minkowski Sum problem in which A can be numerically solved in almost no time (collision detection).

A simplified algorithm for the initial collision test is presented by Casey Muratori in the following talk (link below) where he addresses three topics, implicit surfaces, quaternions and finally, Minkowski Sums (collisions test) giving “hands on” examples for implementation strategies for the latter. Link directly to the Minkowski part:

Youtube link:

// Rolf

anikolo · September 9, 2018, 8:39am

Just saw this post, and I’ve seen it has evolved into an adult discussion about intersections and coding, so take this as a parenthesis:

IF it is a given that the curves don’t intersect, and IF you are not using coding, a -relatively- fast method is to check the inclusion of just a point on the curves.
In my system it took 3 secs to calculate for 1300 rectangles.
curveInCurve.gh (83.6 KB)

RIL · September 9, 2018, 12:11pm

Nice solution for world-aligned rectangles.

// Rolf

aprocellum · September 9, 2018, 1:12pm

If we’re going for pure performance… Here’s a shot using Impala to speed parts of it up. First finding the neighborhood of all nearby curves (curves with a centrepoint within the ‘radius’ of the largest point in the set) and then doing containment testing. All of this is parallel and comes out to just over a hundred milliseconds all told… it’s the boundary surfacing to preview at the end that takes the time

curveInCurve_2.gh (100.3 KB)

anikolo · September 9, 2018, 1:17pm

milliseconds? amazing! downloaded your code to analyze it immediately!
(also: I didn’t know about impala, is it that effective?)

RadovanG · September 12, 2018, 12:04am

Proper code will do it faster. On 4cores/8threads PC attached algorithm will do it in 6 milliseconds, the rest of time is spend for data reading/casting…
curveInCurve_Compare42.gh (86.6 KB)

taniamadah · June 30, 2020, 6:58pm

Hi

I am actually looking to find the biggest rectangle I can fit in an organic closed curve. I was wondering if you have any solution for this.

thank you

markz · July 1, 2020, 4:42am

See this discussion under grasshopper:

" Find the largest possible rectangle inside an irregular closed curve"

PeterFotiadis · July 1, 2020, 5:00am

I have a real-time C# that does that (in 2d/3d) but the practice is closed for the moment. In the mean time see the thread in the link above.

PS the approach is recursive and is “based” on Matrix classic problems like this:

Topic		Replies	Views
Curve.Contains Method too slow Rhino Developer v6 , v5	67	5623	February 18, 2022
Maximise non-overlapping curves from set of overlapping curves Grasshopper Developer	17	985	January 16, 2020
Boundary/Outline "Contest" GH/C# Grasshopper rhinocommon	39	3027	July 7, 2019
Find curves that are contained within another curve Scripting windows	17	201	January 9, 2026
Sort Curves Grasshopper	26	8341	March 1, 2018

A better approach replicating 'Curves in Curves'

Related topics