No slugify of multiselect answers #181

johanneswilm · 2023-03-07T12:46:12Z

Remove slugify.

This PR builds on top of #180 and removes the slugification of answers to multiselect questions. Without this applied, it's not possible to dumpdata/loaddata of pre-existing user data.

… 3.11

This reverts commit 223c9d5.

johanneswilm · 2023-03-07T14:38:42Z

survey/forms.py

-                initial = []
                if answer.body == "[]":
-                    pass
-                elif "[" in answer.body and "]" in answer.body:


@Pierre-Sassoulas I don't fully understand what the idea was here. It looks to me like a "]" and a "[" anywhere in the answer.body would cause this to be initiated for example ("Subtitled [English]"). Maybe that's why you decided to slugify? To make sure there wouldn't be any "[" nor "]" in there?

However, slugifying means that unique choices can be come none unique. If choices are something like "<40" and ">40" then both are translated into "40"

Possibly, I remember this was a bit hacky, we're serializing / deserializing a list of answers here.

johanneswilm · 2023-03-07T14:45:43Z

survey/forms.py

-                    unformated_choices = answer.body[1:-1].strip()
-                    for unformated_choice in unformated_choices.split(settings.CHOICES_SEPARATOR):
-                        choice = unformated_choice.split("'")[1]
-                        initial.append(slugify(choice))


Here I was wondering why the contents of answer.body are not just JSON or some other standardized method. That way one doesn't run risks of this mechanism breaking as one can fall back to existing json deserialization methods.

It's not quite json as of now because it's using single quotation marks rather than double quotation marks. But it's already quite close.

My code below doesn't change that but I think it's slightly more robust in some cases.

This is a small sql database, so I suppose I wanted to avoid dumping a json.

A standardized method (possibily json) would indeed be better than this custom parsing.

@Pierre-Sassoulas Ok, but it wouldn't take up any more space in the database. Instead of

['one', 'two', 'three']

in JSON that would simple be:

["one", "two", "three"]

An advantage would be that it would automatically be able to read things like this as well:

["'one'", "t,w,o", "[three]"]

I think my code below can do most of that as well though. So maybe it's not needed.

I spent a day last week cleaning up a ~90k row dump turning everything back from the slugified version to the original options because loaddata wouldn't accept the data before realizing that I needed to override this code as well to actually make the original data work. That's the reason I raise this.

Sounds like it's going to be hard to make a migration from slugified to the new value (as there is less information in slugified answer than the actual answers) so putting that in a breaking change for version 2.0.0 and removing all the migrations files seems in order, what do you think ?

Or maybe create a "best effort" migration file ?

It should be possible to make a data migration file in which one takes all the options, converts them using slugify and that way make a pretty good guess as to which of the original options is the right one. Two cases where I don't think that's possible:

If the options have changed after some respondents answered. The slugified version will either correspond to none of the current choices or may even correspond to an incorrect one.

Cases where two or more options turn into the same slugified value.

So yeah, probably major version change and some way to gracefully handle the two above situations telling users that "sorry, some data was lost and we can no longer figure out what your respondents actually answered".

Sounds good !

jonafato · 2025-10-24T15:46:43Z

@Pierre-Sassoulas @johanneswilm I know this PR is a couple years old now, but I wanted to see if there was anything I could help with to get this resolved and planned for release. From the discussion here, it sounds like there are a couple of factors to consider:

It sounds like this is already being considered a major version change, so perhaps we can set it for the 2.0.0 milestone and offer pre-release versions for testing purposes.
Some best-effort migration process would be helpful, though it may not always be possible to cleanly migrate old data to the new format, and developers using this migration should be informed about its limitations. Would a (temporary) management command offer more flexibility than a migration file for this purpose? e.g. this might allow the process to be more interactive and / or signal cases where the migration script knows that it cannot be certain of the migration and allow someone to decide whether and how to proceed. (I suppose these cases are somewhat similar to resolving git merge conflicts, where a person may be able to make reasonable decisions that cannot be safely made automatically.)

Pierre-Sassoulas · 2025-10-24T16:05:21Z

Releasing a major version is not an issue, feel free to do It.b It's probably possible to match with high certainty what the unsluggified answer is with difflib.get_close_match and worse case a user input to choose so the migration is 'perfect'. But maybe it's more reasonable to make a best effort migration.

johanneswilm added 4 commits March 7, 2023 13:13

Don't slugify multiple choice choices and make compatible with Python…

99f6fa2

… 3.11

upgrade python/djangon versions in tox.ini

3746ea8

Readd slugify

223c9d5

Remove slugify

d54133d

This reverts commit 223c9d5.

johanneswilm mentioned this pull request Mar 7, 2023

Update to Python 3.11/pyproject.toml/current django versions #180

Merged

Pierre-Sassoulas added the bug label Mar 7, 2023

Pierre-Sassoulas self-requested a review March 7, 2023 12:55

Pierre-Sassoulas added this to the 1.4.7 milestone Mar 7, 2023

Merge branch 'main' into no-slugify

89f2f7f

johanneswilm commented Mar 7, 2023

View reviewed changes

Pierre-Sassoulas modified the milestones: 1.4.7, 1.4.8 Feb 23, 2024

Pierre-Sassoulas removed this from the 1.4.8 milestone Oct 31, 2024

Uh oh!

No slugify of multiselect answers #181

Are you sure you want to change the base?

No slugify of multiselect answers #181

Uh oh!

Conversation

johanneswilm commented Mar 7, 2023

Uh oh!

johanneswilm Mar 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Pierre-Sassoulas Mar 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

johanneswilm Mar 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jonafato commented Oct 24, 2025

Uh oh!

Pierre-Sassoulas commented Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

johanneswilm Mar 7, 2023 •

edited

Loading

Pierre-Sassoulas Mar 7, 2023 •

edited

Loading

johanneswilm Mar 7, 2023 •

edited

Loading