[ot][spam][crazy] in place sort of binary strings ie lines in a file - cypherpunks - lists.cpunks.org

newer
[pt][crazy][crazy] smidgy thoughts...

[ot][spam][crazy] in place sort of binary strings ie lines in a file

older
Calls for informal alliance with ’...

Undescribed Horrific Abuse, One Victim & Survivor of Many

19 Jan 2023 19 Jan '23

9:14 p.m.

make simple enough, then do. i have a strong inhibition in the middle of it, and it’s not complicated, so it’s interesting, since inhibitions are so frustrating and it seems far enough from possible reasons to be theoretically separable

Reply

Sign in to reply online Use email software

Show replies by date

Undescribed Horrific Abuse, One Victim & Survivor of Many

19 Jan 19 Jan

9:15 p.m.

ok um offsets = list of all linebreaks then we form list of all byte ranges from the linebreaks

Reply

Sign in to reply online Use email software

Undescribed Horrific Abuse, One Victim & Survivor of Many

9:17 p.m.

then sort list of byte ranges. now we have a list of starting byte ranges, and a list of the order they go in. now make list of output byte ranges. then cut the lists so that there is no overlap at the edges. the byte ranges turn into sequences. then we can swap the sequences one at a time to reorder the data in place. simple approach. problem remains: cutting them needs to facilitate swapping. need to be clear on what a useful length is. form map of input opto output.

Reply

Sign in to reply online Use email software

Undescribed Horrific Abuse, One Victim & Survivor of Many

9:20 p.m.

now make list of output byte ranges. this can be done by starting at the start and summing the lengths.

then cut the lists so that there is no overlap at the edges. the byte ranges turn into sequences.

ok um we need the data ….. um

then we can swap the sequences one at a time to reorder the data in place. here also we’ll need to be sure the ranges update when the swap happens

Reply

Sign in to reply online Use email software

Undescribed Horrific Abuse, One Victim & Survivor of Many

9:21 p.m.

so basically when a swap happens, one range can overlap the other end of the swap. so we can simplify by swapping only part. maybe it makes sense to cut when the swap happens. but maybe for clarity we can cut first. ok ummm so this means looking in one of the inout/output lists, for the

Reply

Sign in to reply online Use email software

Undescribed Horrific Abuse, One Victim & Survivor of Many

9:22 p.m.

offsets of the other. that can be done with a binary search, which is the bisect module in python. this was probably where the inhibition was stemming from. now it’s migrated to implementation.

Reply

Sign in to reply online Use email software

Undescribed Horrific Abuse, One Victim & Survivor of Many

9:22 p.m.

algorithms are just ways to do things

Reply

Sign in to reply online Use email software

Undescribed Horrific Abuse, One Victim & Survivor of Many

9:24 p.m.

it’s nice to 8mplement it while engaging it ao that migration is less a thing

Reply

Sign in to reply online Use email software

Undescribed Horrific Abuse, One Victim & Survivor of Many

9:28 p.m.

...
then we can swap the sequences one at a time to reorder the data in place. here also we’ll need to be sure the ranges update when the swap happens

so different regions may go to different olaces, si in oython if the data is held as a list of lists, the entries in the lists can be swapped to propagate the references. the input and output data can hold separate items, or identical items. it seems simplest if both lists are swapped, for four total databchanges, but i suppose it makes sense to only swap the inout list, which then represents how the data currently is, and slowly turns into the output list …. i think? kind of. i think a misordered output list. a couple spots unchecked. seems reasonable to try to implement. happy made progress thinking on. not sure ehat to do next.

Reply

Sign in to reply online Use email software

Undescribed Horrific Abuse, One Victim & Survivor of Many

20 Jan 20 Jan

3:14 a.m.

put it together: offsets = list of all linebreaks then we form list of all byte ranges from the linebreaks then sort list of byte ranges. now make list of output byte ranges. this can be done by starting at the start and summing the lengths. then cut the lists so that there is no overlap at the edges. this means looking in one of the inout/output lists, for the offsets of the other while walking through it. that can be done with a binary search, which is the bisect module in python. the byte ranges turn into sequences. then we can swap the sequences one at a time to reorder the data in place. it makes sense to only swap the input list, which then represents how the data currently is, and slowly turns into a misordered output list. —> a problem remains: how to look up offsets to swap out written areas, when the regions are fragmented,p. hence, the working list is not ordered by lines, but rather is an ordered list of regions by offset, and another list gives the regions for each line. i think you do it with an output list and a working list of ordered regions.

Reply

Sign in to reply online Use email software

Undescribed Horrific Abuse, One Victim & Survivor of Many

3:15 a.m.

task: swap out region start at offset 0. say we have all the lists. using all of them is similar to considering which one to use.

Reply

Sign in to reply online Use email software

940

Age (days ago)

941

Last active (days ago)

Download

10 comments

1 participants

tags

participants (1)

Undescribed Horrific Abuse, One Victim & Survivor of Many