Data clustering - Classification Problem

Question

I have a CDT with some kinds of information 
 Alle data in these cdt list have a start date and an end data. I have to find period overlaps. But I'm just resolved that problem. My req now is another, after I find all period that overlaps Ex: index of ordered by date array 2,3,4,5,6 of an array of 7 elements. 
 Now seeing data I know that invervals that overlaps are 2,with 3 and 4 and 5 with 6. 
 My problem is to find that kind of subgroups 
 2-3-4 and 5-6.

Chris · Answer

This may get you closer, this expression returns a list of maps containing overlaps. Current version will list both ways, for instance that 2 overlaps with 3 and 3 overlaps with 2:

a!localVariables(
  local!data: {
    a!map(start_date: todate("01/01/2020"), end_date: todate("01/31/2020")),
    a!map(start_date: todate("03/01/2020"), end_date: todate("03/31/2020")),
    a!map(start_date: todate("03/29/2020"), end_date: todate("04/02/2020")),
    a!map(start_date: todate("04/01/2020"), end_date: todate("04/30/2020")),
    a!map(start_date: todate("05/01/2020"), end_date: todate("05/31/2020")),
    a!map(start_date: todate("05/18/2020"), end_date: todate("05/31/2020")),
    a!map(start_date: todate("06/01/2020"), end_date: todate("06/33/2020"))
  },
  
  a!forEach(
    items: local!data,
    expression: {
      a!localVariables(
        local!index: fv!index,
        local!row: local!data[local!index],
        a!flatten(
          fn!reject(
            fn!isnull,
            a!forEach(
              items: local!data,
              expression: {
                if(
                  fv!index=local!index,
                  null, /* do not review row against itself */
                  if(
                    or(
                      and( /* start date is within start/end for another row */
                        local!row.start_date>=fv!item.start_date,
                        local!row.start_date<=fv!item.end_date
                      ),
                      and( /* end date is within start/end for another row */
                        local!row.end_date>=fv!item.start_date,
                        local!row.end_date<=fv!item.end_date
                      ),
                      and( /* row starts before and ends after evaluating row */
                        local!row.start_date<fv!item.start_date,
                        local!row.end_date>fv!item.end_date
                      )
                    ),
                    a!map( /* return an overlap */
                      row: local!index,
                      overlapsWith: fv!index
                    ),
                    null /* no overlap */
                  )
                )
              }
            )
          )
        )
      )
    }
  )
)

We could refine further if you can define what you would like to see exactly for an output? E.g. how are the "sub groups" to be found, any chain of overlaps essentially?

Data clustering - Classification Problem

Top Replies