alternative algorithm #191

Vilhelm-Ian · 2024-03-17T18:23:05Z

Vilhelm-Ian
Mar 17, 2024

Continuing the discussion in this issue #189

xofm31 · 2024-03-19T02:35:33Z

xofm31
Mar 19, 2024

@Vilhelm-Ian I'm studying Chinese, so I don't have to worry about morphs vs lemmas. Is your desire that all morphs for the same lemma get treated the same in all cases (for calculating frequency, for recalc, other?) Is the lemma always a real word? What language are you learning? If it's not Japanese, we'll need to make sure that whatever gets implemented also works for Japanese.

My current thoughts on calculating the card score are to take a combination of

Frequency penalty: Line # from the frequency file for the unknown morph(s) on the card; even more if not in the frequency file
Length penalty: how far it is from the target # of morphs
Reinforce a morph that is in the learning stage: this one I found in the Morphman comments, and it seems like a good idea in theory, but looking into this made me want to look at how the interval is calculated...
Number of Unknowns penalty: I like using this penalty at 100,000 as Morphman does, because then all of the cards with 1 unknown show up in the 100,000's, all the ones with 2 unknowns are in the 200,000's, etc.

I could be convinced that there are better ways, but my main goal is to focus on getting the most frequent morphs first.

# 1 is different from the Ankimorphs difficulty calculation, because the frequency only is taken into account for the unknown morph(s). Ankimorphs difficulty adds up the frequency of all of the morphs in the sentence, which means that it there is a large effect of the other words in the sentence, and short sentences are prefered more than frequent morphs.

# 2 tries to find cards that have close to that number of morphs. Right now I have it set to 5, but I'm not sure how good that is. The basic idea is to avoid cards that are too short (where you don't get context) and too long (where you get overwhelmed with reading the card).

If we are trying to get a single algorithm, I think either emphasizing the length a lot might work for mortii, or using the Ankimorph difficulty and then scaling it back might work.

Assuming that you just want "treat all morphs with a given lemma the same", I think we'd just have to figure out how to do that, and then if there were a flag to turn this on and off, we could make it work with the same algorithm.

0 replies

Vilhelm-Ian · 2024-03-19T06:11:42Z

Vilhelm-Ian
Mar 19, 2024
Author

I am learning german.

Morphman had a check box that allowed you to treat all lemmas as the same.

In the past I've used morphman to learn japanese. And even then I found the option really useful. But with german is crucial. One word because of thecases and singular, plural forms, tenses can have many many forms.

The first thing you are trying to achieve. I think it's desirable to achieve a sentence with the most common possible words. Since that would make the sentence more comrphensible.
If I learn the word shovel.
I would rather get a sentence like:
It's important to bring a shovel to throw dirt on the fire if necessary.
than:
The decision made, everybody grabbed a shovel and started digging under their wagon.
Even if I know the word waggon. It would be more laborous to parse the sentence in my head.

Morphman had an option that would either skip or deprioritize(don't remember) cards that are too short or too long. I think that would fix both problems

I like the idea of prioritizing cards with recently learned morphs. I have been yearning for that feature for years. So long that I forgot about it.


    if morph.lemma_and_inflection not in morph_priority:
            # Heavily penalizes if a morph is not in frequency file
            difficulty = morph_unknown_penalty - 1

currently we penalize morphs even if there are know but not in the frequency list. Could we instead of penalizing, just skip them in the calculation. That would have the unintended effect of prioritizing words that are not in the frequency list over words that are.

As I am typing this I get what the issue is. What if we addded all the known morphs to the end of the frequency list. That way they won't be prioritized over words in the list and won't be penalized as much words that are not in the list and are not known.
An easer way to achieve that would be. For a word that is known and not in frequency list the penality to be the length of the frequency list + 1.

I would like to also mention this discussion "include grammar difficulty/usefulness #115 " maybe you will get some inspiration.

0 replies

mortii · 2024-03-19T10:39:27Z

mortii
Mar 19, 2024
Maintainer

I'm not going to interject into this discussion (unless I'm explicitly asked a question) because I think it would taint the process. I'm seeing interesting points already so this is great stuff! 👍

0 replies

xofm31 · 2024-03-20T01:24:49Z

xofm31
Mar 20, 2024

@mortii @Vilhelm-Ian "Treat all the different forms of a lemma the same" feels like it is separate from the card scoring (other than you want all of the forms to be considered the same for the purpose of scorint). To actually treat all the forms the same, you'd have to collapse them in every place they exist, including the frequency list. I guess that would require all of the place where the morph is used to be replaced with something that indicates if you're using the inflected form or the base form. Does that sound right to you?

If so, perhaps @Vilhelm-Ian could work on that?

The first thing you are trying to achieve. I think it's desirable to achieve a sentence with the most common possible words

This is in favor of using the Ankimorph way of calculating sentence difficulty. I'm liking the combination of this along with some constraints / preferences on sentence length, because I wouldn't want really short or really long sentence. I agree that we need to think carefully about how to count words that don't exist in the word frequency list.

I will look at the grammar thread again, but I think that integrating it will be beyond my ability.

2 replies

mortii Mar 20, 2024
Maintainer

"Treat all the different forms of a lemma the same" feels like it is separate from the card scoring (other than you want all of the forms to be considered the same for the purpose of scoring).

I think this: (other than you want all of the forms to be considered the same for the purpose of scoring) is exactly what @Vilhelm-Ian wants, so we can narrow it down to that. The only other place it would be applicable is the readability report, but that can be left as a separate issue imo.

To get this "lemma only" thing to work you would have to change the morph_priority dictionary (or its equivalent) to contain the lemmas as the keys, i.e. create a new version of this:

anki-morphs/ankimorphs/recalc.py

Lines 561 to 571 in 6c0b4b5

    
           def _get_morph_priority( 
        
               am_db: AnkiMorphsDB, 
        
               am_config_filter: AnkiMorphsConfigFilter, 
        
           ) -> dict[str, int]: 
        
               if am_config_filter.morph_priority_index == 0: 
        
                   morph_priority = am_db.get_morph_collection_priority() 
        
               else: 
        
                   morph_priority = _get_morph_frequency_file_priority( 
        
                       am_config_filter.morph_priority 
        
                   ) 
        
               return morph_priority

For the frequency files this would be pretty straight forward in that you could just read the Morph-lemma column and ignore/remove the duplicate values.

If so, perhaps @Vilhelm-Ian could work on that?

I'm going to immediately break my own rule and say this:
Absolutely divide and conquer, @Vilhelm-Ian should work on the implementation of this since it's something he is passionate about. Also, work on one specific piece at a time, and test it as soon as possible to get a better feel for how it would impact the result. It's a lot easier (and less mentally taxing) to put two puzzle pieces together instead of waiting to start until you have gathered all thousand pieces or whatever it might turn out to be.

mortii Mar 20, 2024
Maintainer

I will look at the grammar thread again, but I think that integrating it will be beyond my ability.

Yeah, don't work on this, we will leave that for later.

Vilhelm-Ian · 2024-03-20T04:59:15Z

Vilhelm-Ian
Mar 20, 2024
Author

@xofm31 another idea I had. What if in the calculation we don't count young morphs. That way they won't add to the difficulty score to the card so more likely to appear in new cards.

0 replies

mortii · 2024-03-22T22:55:08Z

mortii
Mar 22, 2024
Maintainer

I refactored difficulty to score a bunch of places, which might cause annoying merge problems if you guys have started working on this. If so, it might be better to create a fresh branch off of main instead of trying to merge, and just copy paste from your previous branch. Sorry for the inconvenience 🙏

0 replies

xofm31 · 2024-03-25T01:11:51Z

xofm31
Mar 25, 2024

I hadn't gotten very far, so I will take your advice to start from scratch. I've been really busy, so I haven't had a chance to spend much time on it.

I got sidetracked by wanting to bump up cards that had a word that was in the learning stage to reinforce those words. There was a separate discussion about the longest-interval, and I'd still like to investigate that a bit. But maybe I should table that to put together a first draft of a more flexible (and hopefully not overly complicated) score calculation.

1 reply

Vilhelm-Ian Mar 25, 2024
Author

I got sidetracked by wanting to bump up cards that had a word that was in the learning stage to reinforce those words

What do you think what I suggested not counting those words when calculating the score

xofm31 · 2024-03-27T11:55:32Z

xofm31
Mar 27, 2024

Here is a sketch of what I was thinking. https://github.com/xofm31/anki-morphs/blob/calc_score/ankimorphs/calc_score.py

The main idea is that for each of the criteria, you have a target, and then you have a penalty if the card misses that target. Each of the criteria has a weight. I was thinking that perhaps each of the penalties should be in its own function for readability purposes, but I think that would require looping through all of the morphs on the card multiple times. I defined the target values and weights at the top of the file, but if this were going to be integrated into Ankimorphs, at least some of those would have to be user selections in a menu.

    score = (
        usefulness_weight * usefulness_penalty
        + num_learning_weight * num_learning_penalty
        + num_morphs_weight * num_morphs_penalty
        + sentence_difficulty_weight * sentence_difficulty
    )

What do you think what I suggested not counting those words when calculating the score

I'm not sure exactly how this would be implemented. Is this part of the sentence_difficulty?

2 replies

Vilhelm-Ian Mar 27, 2024
Author

I'm not sure exactly how this would be implemented. Is this part of the sentence_difficulty?

Never mind I like your approach more.

Do you think we should make the weight adjustable. I am not sure if anybody other than me. Would appreciate that approach. Doing that would allow the user to prioritize if he wants to see longer sentcences, sentences with more learning cards, or anything else.

mortii Mar 28, 2024
Maintainer

We could probably add an Algorithm tab in the settings where the parameters could be adjusted. How rough is that draft by the way? Is it functional and giving you good scores?

xofm31 · 2024-03-29T00:49:38Z

xofm31
Mar 29, 2024

Is it functional and giving you good scores?

It is semi-functional. I haven't thought about all of the use cases, and I discovered today when I tried to use it on a frequency file rather than the Collection, that it crashed when the morph wasn't in the frequency list. I corrected it so now it sets it to the unknown morph score - 1.

It gives me approximately what I was hoping for, which is to always give the unknown morph that is highest on the frequency list, but order the cards with that morph according to desired length and having another morph that is in the learning stage. I set the weight for the difficulty of the sentence (which should be essentially your algorithm) to 0.

If you like this approach, then we'd need to work out the best way to score the different factors (for example, should there be one target length, or a range; how to measure the penalty for not being the right length). It might be tricky to figure out how to give options for a user to measure the relative weights of the different factors. I was thinking maybe your sentence-level difficulty could be changed to the average morph difficulty, which ought to make the scores more compatible with the scores from the "unknown morph usefulness". It would also mean that someone could opt for easy words but longer lengths.

I won't have much time to look at this over the next week or two.

1 reply

mortii Mar 29, 2024
Maintainer

Feel free to completely disregard everything I'm about to write since they are my personal preferences and I'm not the target audience for this algorithm.

If you like this approach, then we'd need to work out the best way to score the different factors (for example, should there be one target length, or a range; how to measure the penalty for not being the right length)

Yeah, this is tricky. My personal preference would be to have a bias towards shorter sentences, i.e. giving them less punishment, since I find them easier to learn. That is actually the reason I scrapped it from my algorithm, I found that it consistently defaulted to long and complicated sentences which were harder to learn than shorter sentences.

I was thinking maybe your sentence-level difficulty could be changed to the average morph difficulty, which ought to make the scores more compatible with the scores from the "unknown morph usefulness". It would also mean that someone could opt for easy words but longer lengths.

Sounds interesting! You should definitely experiment with that.

I won't have much time to look at this over the next week or two.

No worries, take your time. I kinda want to finish the study plan feature before really focusing on this, so that works out great for me too.

xofm31 · 2024-04-05T18:27:26Z

xofm31
Apr 5, 2024

I cleaned it up a bit. Compared to my initial draft:

The "difficulty" is now average morph difficulty
The desired sentence length (in morphs) is a range
For a morph not in the priority list, the penalty is no_morph_priority_value = len(morph_priority) + 1 (although I actually didn't test it)

I did get get shorter sentences by making the length penalty significantly higher than the difficulty penalty and having a desired sentence length of 0. But unless you only want short sentences, I think the ideal settings will vary greatly based on how long your frequency list is and how many morphs you already know. I can't think of a way to make the equations take this into account, other than using something like the average morph_priority of the top 100 unknown morphs. That would get ugly, and I'm not sure if it would work. This applies also with the usefulness of an unknown morph, but for me is not so much of an issue because I want it to always prioritize the usefulness, and just have the other criteria be secondary, basically to order the sentences with that morph.

Here's the branch:
https://github.com/xofm31/anki-morphs/blob/calc/ankimorphs/calc_score.py

Previously there was discussion about treating all inflections of the same lemma the same. I didn't look at that at all - I'm not sure if @Vilhelm-Ian has or not. Basically, I think it would require changing morph.lemma_and_inflection. To get the usefulness right, you'd probably also need to create a new frequency fille so that all the different forms would contribute to frequency of the same lemma (ex. walk, walking, walked all contribute to the same walk lemma frequency rather than to separate inflections).

1 reply

Vilhelm-Ian Apr 5, 2024
Author

i haven't gotten to it

mortii · 2024-04-17T12:22:46Z

mortii
Apr 17, 2024
Maintainer

I don't want to step on your toes @Vilhelm-Ian, but finishing this algorithm is now first priority, so I'll start implementing the lemma stuff now to get it done. Maybe you could help test it out eventually since you have a good understanding of the problem?

2 replies

Vilhelm-Ian Apr 17, 2024
Author

i am sorry very much morti. I have trouble managing my time. The lemma stuff you can just leave it to me when I get to it. Since I am the only one who has shown intrest in it

mortii Apr 18, 2024
Maintainer

@Vilhelm-Ian no, no, no, don't be sorry. In fact, I want to you not spend time on this if you can't fit it into your schedule, or if it's simply not something that you are excited to work on.

Since I am the only one who has shown intrest in it

This problem actually makes it impossible to learn Korean with AnkiMorphs (#201), so the severity of this has grown exponentially.

I will work on this, and if you could assist in the testing phase that would be awesome 👍

mortii · 2024-04-17T13:17:42Z

mortii
Apr 17, 2024
Maintainer

I feel like these should be in the reverse order:

card id: 1691325167622
card_morphs: ['お', 'から', 'た', 'だ', 'な', 'て', 'と', 'な', 'に', 'の', 'もの', 'もらっ', 'よう', '俺', '助け', '命', '子', '様', '館']
unknown_morphs: 1
num_learning_penalty: 4
num_morphs_penalty: 13
ave_difficulty_penalty: 89
score: 1002702

card id: 1691325167353
card_morphs: ['お', '様', '次回', '第', '話', '館']
unknown_morphs: 1
num_learning_penalty: 2
num_morphs_penalty: 0
ave_difficulty_penalty: 123
score: 1002713

maybe the num_morphs_weight * num_morphs_penalty should be be some exponential equation instead?

Also this is bugged:
https://github.com/xofm31/anki-morphs/blob/df2824443054e95aa17581fac67c495a716145bc/ankimorphs/calc_score.py#L107

it should be < not <=. It was fixed in ce55465 on the main repo.

Edit

I pushed the code and some of changes to the https://github.com/mortii/anki-morphs/tree/algorithm branch. Let's use this branch as the origin and then make pull-requests on that when we want to make changes. Don't worry about git the git history, we will clean that up at the end.

5 replies

mortii Apr 17, 2024
Maintainer

maybe the num_morphs_weight * num_morphs_penalty should be be some exponential equation instead?

using this the results are better imo:

ALL_MORPHS_DISTANCE_WEIGHT = 10

score = (
    TOTAL_PRIORITY_FOR_UNKNOWN_WEIGHT * total_priority_for_unknown
    + AVG_PRIORITY_WEIGHT * avg_priority
    + LEARNING_MORPHS_DISTANCE_WEIGHT * (learning_morphs_distance**2)
    + ALL_MORPHS_DISTANCE_WEIGHT * (all_morphs_distance**2)
)

xofm31 Apr 17, 2024

Thank you for doing this! Now I need to go back and remember what I did.

I'm pretty busy, but I assume that there is some time to work through the details of this.

Since the score incorporates the number of learning morphs, I'll plan to incorporate the interval stuff into this branch if that's OK.

My first update will be to update the description of the score with the new variable names. I put it in the algorithm branch and then do a pull request?

xofm31 Apr 17, 2024

I feel like these should be in the reverse order:

With the weights as given, it will almost always order a card with a more frequent unknown morph than a less frequent unknown morph. You will need to up your average priority weight.

TOTAL_PRIORITY_FOR_UNKNOWN_WEIGHT = 10
AVG_PRIORITY_WEIGHT = 1

If you set the average priority weight, total priority weight, and all morphs distance to 1, and set your target sentence length to 0, without the exponential, I think you should wind up with the same ordering as your current scoring algorithm. With the exponential, it will give a much bigger penalty for longer sentences.

mortii Apr 18, 2024
Maintainer

Ideally, i would like a piecewise punishment function which gives less punishment for undershooting the target length compared to overshooting it: https://www.geogebra.org/graphing/ythx8de5

At that point though, we might as well provide an input field where the user could specify entire functions, which is much more complicated...

mortii Apr 18, 2024
Maintainer

At that point though, we might as well provide an input field where the user could specify entire functions, which is much more complicated...

Actually, this might be genius... This is similar to what the FSRS add-on did previously, where you just paste in entire functions that produces the results you want.

mortii · 2024-04-17T17:50:37Z

mortii
Apr 17, 2024
Maintainer

@xofm31 Using names like "difficulty" and "usefulness" is not great because they are completely subjective, which essentially makes them opaque, so I renamed most of the identifiers.

After doing that, I see that the same thing is basically done twice, but I don't see the reason for it:

anki-morphs/ankimorphs/calc_score.py

Lines 88 to 93 in bc43fee

    
           if morph.highest_learning_interval == 0: 
        
               unknown_morphs.append(morph) 
        
               if morph.lemma_and_inflection in morph_priority: 
        
                   total_priority_for_unknown += morph_priority[morph.lemma_and_inflection] 
        
               else: 
        
                   total_priority_for_unknown += no_morph_priority_value

anki-morphs/ankimorphs/calc_score.py

Lines 99 to 103 in bc43fee

    
           if morph.lemma_and_inflection not in morph_priority: 
        
               # Heavily penalizes if a morph is not in frequency file 
        
               total_priority = no_morph_priority_value 
        
           else: 
        
               total_priority += morph_priority[morph.lemma_and_inflection]

and the score looks like this:

anki-morphs/ankimorphs/calc_score.py

Lines 113 to 118 in bc43fee

    
           score = ( 
        
               TOTAL_PRIORITY_FOR_UNKNOWN_WEIGHT * total_priority_for_unknown 
        
               + AVG_PRIORITY_WEIGHT * avg_priority 
        
               + LEARNING_MORPHS_DISTANCE_WEIGHT * learning_morphs_distance 
        
               + ALL_MORPHS_DISTANCE_WEIGHT * all_morphs_distance 
        
           )

what is being achieved by TOTAL_PRIORITY_FOR_UNKNOWN_WEIGHT * total_priority_for_unknown ?

Edit

Nvm, I realize you want to disentangle the unknowns from the rest of the sentence since you mostly care about the unknowns, that's fine.

0 replies

mortii · 2024-04-18T13:48:36Z

mortii
Apr 18, 2024
Maintainer

At that point though, we might as well provide an input field where the user could specify entire functions, which is much more complicated...

Actually, this might be genius... This is similar to what the FSRS add-on did previously, where you just paste in entire functions that produces the results you want.

Originally posted by @mortii in #191 (reply in thread)

Trying to come up with a general purpose algorithm is a fool's errand, and we should instead transition into making an api where people can specify their own algorithm. I'll make a sketch shortly.

EDIT:
Something like this maybe:

thoughts?

4 replies

xofm31 Apr 18, 2024

Trying to come up with a general purpose algorithm is a fool's errand,

I agree with this. I don't know what the solution is. Unfortunately, I do think that different people prefer different length sentences, and different people likewise care about the other criteria. For myself, I'd be happy to create my own equation, but I'm not sure if most people would want to.

mortii Apr 18, 2024
Maintainer

These fields will obviously have default values (the ones we use right now seem fine), that way people don't have to build their own algorithm from scratch if they don't want to.

mortii Apr 19, 2024
Maintainer

@xofm31 I just changed some parameters of the algorithm, could you test if you are able to adjust them to the point where you get the results you want?

xofm31 Apr 20, 2024

Yes this works for me.

mortii · 2024-04-18T17:35:50Z

mortii
Apr 18, 2024
Maintainer

So the default could be something like this:

and then this would be how I would personally modify it:

Edit:

^should be difference, not distance

11 replies

mortii Apr 21, 2024
Maintainer

I assume you don't have the move new cards [...] option activated in this scenario.

Say you have 10 consecutive cards with the same learning morph, e.g.:
card 1: I went to the shop
card 2: She went to the shop
...

The study schedule plays out like this:

Day one: card 1 (I went to the shop) is studied, the remaining 9 are buried
Day two: card 2 (She went to the shop) is studied, and the remaining 8 are buried
etc.

Now let's say that after studying the fifth card, the morph is no longer learning, but since the move new cards [...] option isn't activated, you will now just see five consecutive cards with the morph shop. What do you do at this point?

ghost Apr 21, 2024

It looks like i might have been confused about how the algorithm is working. I just tested with the move new cards [...] option disabled and got the behaviour i think i was describing, although i can't test for a morph that has just become known. Sorry for the confusion!

Then i have to ask, what's the difference between skip cards with only known morphs and move new cards without unknown morphs to the end of the due queue? I can't figure out from the documentation how the user should see a difference with them.

since the move new cards [...] option isn't activated, you will now just see five consecutive cards with the morph shop.

If you have skip cards with only known morphs enabled (which seems to be the default), i think it would make sense that these extra cards are skipped?

mortii Apr 21, 2024
Maintainer

Then i have to ask, what's the difference between skip cards with only known morphs and move new cards without unknown morphs to the end of the due queue? I can't figure out from the documentation how the user should see a difference with them.

Sorry about that, I'm bad at explaining things.

skip cards with only known morphs: Buries cards when they are encountered during a review
move new cards without unknown morphs to the end of the due queue: this moves them to the end of the queue, so they aren't encountered at all. This should render the skip option redundant.

Does that make sense?

skip cards with only known morphs enabled (which seems to be the default)

I don't remember when that happened, but maybe it shouldn't be... what do you think?

ghost Apr 21, 2024

I see, so when move new cards [...] is enabled, the state of skip card [...] makes no difference? I understand now, thanks! Perhaps it would be a good idea to disable this option if the other is enabled?

I don't remember when that happened, but maybe it shouldn't be... what do you think?

It's the behaviour of the restore defaults button. Personally i think it's fine to be enabled by default, as it's likely to be a feature a lot of people appreciate, and also demonstrates some considerable utility of the extension. But perhaps other people disagree.

mortii Apr 21, 2024
Maintainer

Perhaps it would be a good idea to disable this option if the other is enabled?

Makes sense, I'll add it to the todo list in #218

I don't remember when that happened, but maybe it shouldn't be... what do you think?

It's the behaviour of the restore defaults button

I meant that I wasn't sure when I made that the default, but I just checked, and it has been the same since the very start of the project.

Personally i think it's fine to be enabled by default

I agree 👍

mortii · 2024-05-18T09:53:09Z

mortii
May 18, 2024
Maintainer

Okay, I have a (very badly coded) test version that has implemented the new algorithm and an option to choose between lemma priority and inflection priority!

This version can be found in the latest algorithm branch, or you can download it from google drive (github doesn't like .addon files): ankimorphs-v3-0-0-testing-1

The algorithm "morph targets" stuff looks complicated, but all you do is define a range with no punishment, and then either side of that range you can specify a punishment curve (ax^2+bx+c), the default is this:

(graph link)

Any and all feedback would be very much appreciated!

0 replies

fuquasteve · 2024-05-18T14:16:52Z

fuquasteve
May 18, 2024

I installed the new version and noticed some different behavior. I have to disable the regular version of Ankimorphs to run it (lowercase new version, uppercase stable). What I expected. I left the Algorithm settings just as they installed. I included a screenshot. I had to run recalc twice to get the "shift cards that are not the first to have an unknown..." part to run on and move duplicates out. Since I had added the new fields into the notetype that ankimorphs uses, I had to upload a full sync of the decks thru ankiweb. Ankiweb immediately choked and spat the collection out as too big. I had a ridiculously big collection, 192000 cards anyway, so I deleted a bunch of stuff to get it down to a reasonable size. No problem, I had been meaning to do this anyway. I synced a again and I got some strange looking results in how things are sorting. I included a screenshot showing a few fields for the top priority card (no am-unknowns!), and another card where things may be working . I think I may have messed up my collection with my deletion of around 100000 cards to reduce the size of the the collection.

…

On Sat, May 18, 2024 at 2:53 AM mortii ***@***.***> wrote: Okay, I have a (very badly coded) test version that has implemented the new algorithm and an option to choose between lemma priority and inflection priority! This version can be found in the latest algorithm <https://github.com/mortii/anki-morphs/tree/algorithm> branch, or you can download it from google drive (github doesn't like .addon files): ankimorphs-v3-0-0-testing-1 <https://drive.google.com/file/d/1jMrVYax4aAXOTDYxMT1NMzXIuXL-_Mp_/view?usp=sharing> The algorithm "morph targets" stuff looks complicated, but all you do is define a range with no punishment, and then either side of that range you can specify a punishment curve (ax^2+bx+c), the default is this: Screenshot.from.2024-05-18.11-49-09.png (view on web) <https://github.com/mortii/anki-morphs/assets/15674619/913466c4-1057-4a15-9499-e7516f738e52> (graph link <https://www.geogebra.org/graphing/ta3eqb8y>) Any and all feedback would be very much appreciated! — Reply to this email directly, view it on GitHub <#191 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/APG7PUPS6UJJ233U2SDWBZLZC4QJVAVCNFSM6AAAAABE2NC532VHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM4TINZXG44TC> . You are receiving this because you were mentioned.Message ID: ***@***.***>

-- So often I have stood there flummoxed in a state of tartlement!

2 replies

mortii May 18, 2024
Maintainer

My bad, I should have mentioned that you should make backups of your collection before trying this version. Using a new profile is probably also a good idea, that way you don't have to worry about syncing the collection, among other things.

I included a screenshot.

I don't see it. Did you reply via email? Maybe images only works when inserting them directly on github.

I had to run recalc twice to get the "shift cards that are not the first to have an unknown..." part to run on and move duplicates out.

Hmm, I haven't tested it with this option, but I don't think it should be effected...

fuquasteve May 18, 2024

No problem. It's well backed up :-)

fuquasteve · 2024-05-18T14:25:23Z

fuquasteve
May 18, 2024

Oh, that m-unknowns column in my screenshot of my browser window is am-unknowns, of course.

…

On Sat, May 18, 2024 at 7:16 AM stephen fuqua ***@***.***> wrote: I installed the new version and noticed some different behavior. I have to disable the regular version of Ankimorphs to run it (lowercase new version, uppercase stable). What I expected. I left the Algorithm settings just as they installed. I included a screenshot. I had to run recalc twice to get the "shift cards that are not the first to have an unknown..." part to run on and move duplicates out. Since I had added the new fields into the notetype that ankimorphs uses, I had to upload a full sync of the decks thru ankiweb. Ankiweb immediately choked and spat the collection out as too big. I had a ridiculously big collection, 192000 cards anyway, so I deleted a bunch of stuff to get it down to a reasonable size. No problem, I had been meaning to do this anyway. I synced a again and I got some strange looking results in how things are sorting. I included a screenshot showing a few fields for the top priority card (no am-unknowns!), and another card where things may be working . I think I may have messed up my collection with my deletion of around 100000 cards to reduce the size of the the collection. On Sat, May 18, 2024 at 2:53 AM mortii ***@***.***> wrote: > Okay, I have a (very badly coded) test version that has implemented the > new algorithm and an option to choose between lemma priority and inflection > priority! > > This version can be found in the latest algorithm > <https://github.com/mortii/anki-morphs/tree/algorithm> branch, or you > can download it from google drive (github doesn't like .addon files): > ankimorphs-v3-0-0-testing-1 > <https://drive.google.com/file/d/1jMrVYax4aAXOTDYxMT1NMzXIuXL-_Mp_/view?usp=sharing> > > The algorithm "morph targets" stuff looks complicated, but all you do is > define a range with no punishment, and then either side of that range you > can specify a punishment curve (ax^2+bx+c), the default is this: > > Screenshot.from.2024-05-18.11-49-09.png (view on web) > <https://github.com/mortii/anki-morphs/assets/15674619/913466c4-1057-4a15-9499-e7516f738e52> > > (graph link <https://www.geogebra.org/graphing/ta3eqb8y>) > > Any and all feedback would be very much appreciated! > > — > Reply to this email directly, view it on GitHub > <#191 (comment)>, > or unsubscribe > <https://github.com/notifications/unsubscribe-auth/APG7PUPS6UJJ233U2SDWBZLZC4QJVAVCNFSM6AAAAABE2NC532VHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM4TINZXG44TC> > . > You are receiving this because you were mentioned.Message ID: > ***@***.***> > -- So often I have stood there flummoxed in a state of tartlement!

-- So often I have stood there flummoxed in a state of tartlement!

0 replies

fuquasteve · 2024-05-18T14:30:07Z

fuquasteve
May 18, 2024

I deleted the am-known-automatically tag and the am-ready tag from the collection and did a recalc, and nothing happened. I'll play with this some more (quietly). :-)

…

On Sat, May 18, 2024 at 7:25 AM stephen fuqua ***@***.***> wrote: Oh, that m-unknowns column in my screenshot of my browser window is am-unknowns, of course. On Sat, May 18, 2024 at 7:16 AM stephen fuqua ***@***.***> wrote: > I installed the new version and noticed some different behavior. > I have to disable the regular version of Ankimorphs to run it (lowercase > new version, uppercase stable). What I expected. > I left the Algorithm settings just as they installed. I included a > screenshot. > I had to run recalc twice to get the "shift cards that are not the first > to have an unknown..." part to run on and move duplicates out. > Since I had added the new fields into the notetype that ankimorphs uses, > I had to upload a full sync of the decks thru ankiweb. Ankiweb immediately > choked and spat the collection out as too big. I had a ridiculously big > collection, 192000 cards anyway, so I deleted a bunch of stuff to get it > down to a reasonable size. No problem, I had been meaning to do this > anyway. > I synced a again and I got some strange looking results in how things are > sorting. I included a screenshot showing a few fields for the top > priority card (no am-unknowns!), and another card where things may be > working . > > I think I may have messed up my collection with my deletion of around > 100000 cards to reduce the size of the the collection. > > On Sat, May 18, 2024 at 2:53 AM mortii ***@***.***> wrote: > >> Okay, I have a (very badly coded) test version that has implemented the >> new algorithm and an option to choose between lemma priority and inflection >> priority! >> >> This version can be found in the latest algorithm >> <https://github.com/mortii/anki-morphs/tree/algorithm> branch, or you >> can download it from google drive (github doesn't like .addon files): >> ankimorphs-v3-0-0-testing-1 >> <https://drive.google.com/file/d/1jMrVYax4aAXOTDYxMT1NMzXIuXL-_Mp_/view?usp=sharing> >> >> The algorithm "morph targets" stuff looks complicated, but all you do is >> define a range with no punishment, and then either side of that range you >> can specify a punishment curve (ax^2+bx+c), the default is this: >> >> Screenshot.from.2024-05-18.11-49-09.png (view on web) >> <https://github.com/mortii/anki-morphs/assets/15674619/913466c4-1057-4a15-9499-e7516f738e52> >> >> (graph link <https://www.geogebra.org/graphing/ta3eqb8y>) >> >> Any and all feedback would be very much appreciated! >> >> — >> Reply to this email directly, view it on GitHub >> <#191 (comment)>, >> or unsubscribe >> <https://github.com/notifications/unsubscribe-auth/APG7PUPS6UJJ233U2SDWBZLZC4QJVAVCNFSM6AAAAABE2NC532VHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM4TINZXG44TC> >> . >> You are receiving this because you were mentioned.Message ID: >> ***@***.***> >> > > > -- > > So often I have stood there flummoxed in a state of tartlement! > > -- So often I have stood there flummoxed in a state of tartlement!

-- So often I have stood there flummoxed in a state of tartlement!

0 replies

fuquasteve · 2024-05-18T16:47:10Z

fuquasteve
May 18, 2024

Is it working the way it is supposed to? Is it giving me some cards with no am-unknowns that have morphs that are still in the learning stage?
This could be very useful.

1 reply

mortii May 18, 2024
Maintainer

Ah, interesting. Could you give some examples?

fuquasteve · 2024-05-18T19:29:43Z

fuquasteve
May 18, 2024

I'll try the images on GitHub. Added back some of the cards that I deleted, and the algorithm gave me a bunch of cards (maybe 40) without am-unknowns. My impression is they were very good cards, nice reviews. The cards I added back in were from a couple of Murakami books, which are often different from the the usual anime deck cards.

…

On Sat, May 18, 2024, 11:11 AM mortii ***@***.***> wrote: Ah, interesting. Could you give some examples? — Reply to this email directly, view it on GitHub <#191 (reply in thread)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/APG7PUMV5KYGIPOX3G4ZCIDZC6KVJAVCNFSM6AAAAABE2NC532VHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM4TIOBRGM4DM> . You are receiving this because you were mentioned.Message ID: ***@***.***>

0 replies

fuquasteve · 2024-05-18T19:37:53Z

fuquasteve
May 18, 2024

Let me try to attach the screenshots here.
I hope they make sense.

I hope this works.

1 reply

mortii May 19, 2024
Maintainer

@fuquasteve awesome, thanks for the feedback!

I just realized that I mixed up the default upper and lower target values, that will definitely cause some weird issues, I'll fix it asap.

fuquasteve · 2024-05-18T20:47:45Z

fuquasteve
May 18, 2024

Here are a couple more, focusing on the part of the screen that might be interesting.

0 replies

fuquasteve · 2024-05-18T21:03:01Z

fuquasteve
May 18, 2024

I think what is happening is that the cards without am-unknowns are not moving to the end of the deck.
Or the am-ready cards are getting pushed down too far.
This is for the card with the lowest location, the next card I will do tomorrow...

location after recalc:10000639

am-unknowns:0

am-score:2047483647

tags:am-ready, Bakuman_S02∷14

2 replies

mortii May 19, 2024
Maintainer

location after recalc:10000639
am-unknowns:0
am-score:2047483647

yeah, this is very weird. It almost certainly comes from using the "shift new cards [...]" option. I'll look into it.

mortii May 19, 2024
Maintainer

@fuquasteve I'm having problems reproducing the behaviour, could you share your settings?

Go to: Tools -> Add-ons -> ankimorphs -> "Config" button on the lower right sidebar, and then copy paste everything, .e.g.:

    "algorithm_all_morphs_target_distance": 1,
    "algorithm_average_priority_all_morphs": 0,
    "algorithm_inflection_priority": true,
    "algorithm_learning_morphs_target_distance": 5,
    "algorithm_lemma_priority": false,
    ...
    ...

fuquasteve · 2024-05-19T18:48:16Z

fuquasteve
May 19, 2024

I'll do that in a couple of hours. I did delete a bunch of cards just before I used the new version,and I wonder if I broke something...

…

On Sun, May 19, 2024, 9:12 AM mortii ***@***.***> wrote: @fuquasteve <https://github.com/fuquasteve> I'm having problems reproducing the behaviour, could you share your settings? Go to: Tools -> Add-ons -> ankimorphs -> "Config" button on the lower right sidebar, and then copy paste everything, .e.g.: "algorithm_all_morphs_target_distance": 1, "algorithm_average_priority_all_morphs": 0, "algorithm_inflection_priority": true, "algorithm_learning_morphs_target_distance": 5, "algorithm_lemma_priority": false, ... ... — Reply to this email directly, view it on GitHub <#191 (reply in thread)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/APG7PUNFHFHRR5EVYSIB2NLZDDFP5AVCNFSM6AAAAABE2NC532VHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM4TIOBYHA3TG> . You are receiving this because you were mentioned.Message ID: ***@***.***>

0 replies

fuquasteve · 2024-05-19T19:35:14Z

fuquasteve
May 19, 2024

Maybe mess d up tags or something...

…

On Sun, May 19, 2024, 11:47 AM stephen fuqua ***@***.***> wrote: I'll do that in a couple of hours. I did delete a bunch of cards just before I used the new version,and I wonder if I broke something... On Sun, May 19, 2024, 9:12 AM mortii ***@***.***> wrote: > @fuquasteve <https://github.com/fuquasteve> I'm having problems > reproducing the behaviour, could you share your settings? > > Go to: Tools -> Add-ons -> ankimorphs -> "Config" button on the lower > right sidebar, and then copy paste everything, .e.g.: > > "algorithm_all_morphs_target_distance": 1, > "algorithm_average_priority_all_morphs": 0, > "algorithm_inflection_priority": true, > "algorithm_learning_morphs_target_distance": 5, > "algorithm_lemma_priority": false, > ... > ... > > — > Reply to this email directly, view it on GitHub > <#191 (reply in thread)>, > or unsubscribe > <https://github.com/notifications/unsubscribe-auth/APG7PUNFHFHRR5EVYSIB2NLZDDFP5AVCNFSM6AAAAABE2NC532VHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM4TIOBYHA3TG> > . > You are receiving this because you were mentioned.Message ID: > ***@***.***> >

0 replies

fuquasteve · 2024-05-19T21:25:54Z

fuquasteve
May 19, 2024

Here is the result of
Tools -> Add-ons -> ankimorphs -> "Config" button on the lower right sidebar, and then copy paste everything,

I hope this helps

{
"algorithm_all_morphs_target_distance": 1,
"algorithm_average_priority_all_morphs": 0,
"algorithm_inflection_priority": true,
"algorithm_learning_morphs_target_distance": 5,
"algorithm_lemma_priority": false,
"algorithm_lower_target_all_morphs": 4,
"algorithm_lower_target_all_morphs_coefficient_a": 0,
"algorithm_lower_target_all_morphs_coefficient_b": 1,
"algorithm_lower_target_all_morphs_coefficient_c": 0,
"algorithm_lower_target_learning_morphs": 2,
"algorithm_lower_target_learning_morphs_coefficient_a": 1,
"algorithm_lower_target_learning_morphs_coefficient_b": 0,
"algorithm_lower_target_learning_morphs_coefficient_c": 0,
"algorithm_total_priority_all_morphs": 1,
"algorithm_total_priority_unknown_morphs": 10,
"algorithm_upper_target_all_morphs": 6,
"algorithm_upper_target_all_morphs_coefficient_a": 1,
"algorithm_upper_target_all_morphs_coefficient_b": 0,
"algorithm_upper_target_all_morphs_coefficient_c": 0,
"algorithm_upper_target_learning_morphs": 1,
"algorithm_upper_target_learning_morphs_coefficient_a": 1,
"algorithm_upper_target_learning_morphs_coefficient_b": 0,
"algorithm_upper_target_learning_morphs_coefficient_c": 0,
"filters": [
{
"extra_highlighted": true,
"extra_score": true,
"extra_score_terms": true,
"extra_unknowns": true,
"extra_unknowns_count": true,
"field": "Japanese",
"modify": true,
"morph_priority": "Collection frequency",
"morphemizer_description": "AnkiMorphs: Japanese",
"note_type": "Subs2srsAnkimorphs",
"read": true,
"tags": {
"exclude": [],
"include": []
}
},
{
"extra_highlighted": false,
"extra_score": false,
"extra_score_terms": false,
"extra_unknowns": false,
"extra_unknowns_count": false,
"field": "Jlab-Kanji",
"modify": false,
"morph_priority": "Collection frequency",
"morphemizer_description": "AnkiMorphs: Japanese",
"note_type": "JlabNote-JlabConverted-1+",
"read": true,
"tags": {
"exclude": [],
"include": []
}
},
{
"extra_highlighted": false,
"extra_score": false,
"extra_score_terms": false,
"extra_unknowns": false,
"extra_unknowns_count": false,
"field": "Jlab-Kanji",
"modify": false,
"morph_priority": "Collection frequency",
"morphemizer_description": "AnkiMorphs: Japanese",
"note_type": "JlabNote-JlabConverted-1++",
"read": true,
"tags": {
"exclude": [],
"include": []
}
},
{
"extra_highlighted": false,
"extra_score": false,
"extra_score_terms": false,
"extra_unknowns": false,
"extra_unknowns_count": false,
"field": "Jlab-Kanji",
"modify": false,
"morph_priority": "Collection frequency",
"morphemizer_description": "AnkiMorphs: Japanese",
"note_type": "JlabNote-JlabConverted-1-b1610",
"read": true,
"tags": {
"exclude": [],
"include": []
}
},
{
"extra_highlighted": false,
"extra_score": false,
"extra_score_terms": false,
"extra_unknowns": false,
"extra_unknowns_count": false,
"field": "Japanese",
"modify": false,
"morph_priority": "Collection frequency",
"morphemizer_description": "AnkiMorphs: Japanese",
"note_type": "KEEP-Japanese read_repeat_pronounce_ja-+++",
"read": true,
"tags": {
"exclude": [],
"include": []
}
}
],
"preprocess_ignore_bracket_contents": false,
"preprocess_ignore_names_morphemizer": false,
"preprocess_ignore_names_textfile": false,
"preprocess_ignore_round_bracket_contents": false,
"preprocess_ignore_slim_round_bracket_contents": false,
"preprocess_ignore_suspended_cards_content": false,
"recalc_due_offset": 500000,
"recalc_interval_for_known": 21,
"recalc_move_known_new_cards_to_the_end": true,
"recalc_number_of_morphs_to_offset": 100,
"recalc_offset_new_cards": true,
"recalc_on_sync": false,
"recalc_read_known_morphs_folder": false,
"recalc_suspend_known_new_cards": false,
"recalc_toolbar_stats_use_known": false,
"recalc_toolbar_stats_use_seen": true,
"recalc_unknowns_field_shows_inflections": true,
"recalc_unknowns_field_shows_lemmas": false,
"shortcut_browse_all_same_unknown": "Shift+L",
"shortcut_browse_ready_same_unknown": "L",
"shortcut_browse_ready_same_unknown_lemma": "Ctrl+Shift+L",
"shortcut_generators": "Ctrl+Shift+G",
"shortcut_known_morphs_exporter": "Ctrl+Shift+E",
"shortcut_learn_now": "Ctrl+Alt+N",
"shortcut_recalc": "Ctrl+M",
"shortcut_set_known_and_skip": "K",
"shortcut_settings": "Ctrl+O",
"shortcut_view_morphemes": "Ctrl+Alt+V",
"skip_only_known_morphs_cards": false,
"skip_show_num_of_skipped_cards": false,
"skip_unknown_morph_seen_today_cards": false,
"tag_known_automatically": "am-known-automatically",
"tag_known_manually": "am-known-manually",
"tag_learn_card_now": "am-learn-card-now",
"tag_not_ready": "am-not-ready",
"tag_ready": "am-ready"
}

0 replies

mortii · 2024-05-20T09:34:22Z

mortii
May 20, 2024
Maintainer

Released a new testing build in the v3 megathread (#222)

New testing build: 3.0.0-testing-2 (google drive)

Changelog

"lower morph targets" settings are now capped to always be lower than the "upper morph targets"

@fuquasteve The bug looks pretty bad, and I don't think it's because you did anything wrong, so it should be fixed. I'm not able to reproduce it with my card collection, so could you potentially share yours? If you go to your anki profile folder there is a file called "collection.anki2", if you upload that to google drive, or any other file sharing platform where I could download it, that would be amazing 🙏

3 replies

fuquasteve May 20, 2024

Here is a link to it https://drive.google.com/file/d/1YnrGa0zjO6rhtOY0whHdusNY7J7Gwgj1/view?usp=sharing
Let me know if you need anything else.
I may have hidden the problem last night.
I wanted to do my reviews, so I did two recalcs with the move cards options turned off.
Then I did a recalc with only the move cards with only known cards, and another recalc with both the move cards with only known cards and the move cards with repeat morphs.
The deck looked OK then.
I hope this is useful for you.

mortii May 20, 2024
Maintainer

@fuquasteve thanks!

I'm not getting unexpected values for cards that have no unknown morphs, they are all given max score:

Let me know if you experience it again, and hopefully we will be able to debug it then.

Vilhelm-Ian Jun 19, 2024
Author

@mortii tried it out. I tried it with the lemma option. And it prioritiezes cards where the lemma is known even more.

mortii · 2024-06-20T09:35:12Z

mortii
Jun 20, 2024
Maintainer

Released a new test version:

New testing build: 3.0.0-testing-3 (google drive)

Alterantively, checkout the known-lemma branch.

Changelog

if "Morph Prioirity: Lemma" is selected:
- cards can now be skipped on review if the morphs lemmas are already known
- all inflections are set to known during recalc if their lemma is known

In future versions the "Morph Prioirity: Lemma" option will be renamed, and it will be moved to a new "General" tab in the settings.

Originally posted by @mortii in #222 (comment)

@Vilhelm-Ian does this build work better? It includes the changes discussed in #141.

1 reply

Vilhelm-Ian Jun 23, 2024
Author

it works like I expected.

THANK YOU THANK YOU YALL

mortii · 2024-08-02T19:07:00Z

mortii
Aug 2, 2024
Maintainer

Included in the v3 update: https://github.com/mortii/anki-morphs/releases/tag/v3.0.0

Thank you all!

0 replies

alternative algorithm #191

Uh oh!

Replies: 36 comments · 59 replies

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Vilhelm-Ian Mar 19, 2024 Author

Uh oh!

mortii Mar 19, 2024 Maintainer

Uh oh!

Uh oh!

mortii Mar 20, 2024 Maintainer

Uh oh!

mortii Mar 20, 2024 Maintainer

Uh oh!

Uh oh!

Vilhelm-Ian Mar 20, 2024 Author

Uh oh!

Uh oh!

mortii Mar 22, 2024 Maintainer

Uh oh!

Uh oh!

Vilhelm-Ian Mar 25, 2024 Author

Uh oh!

Uh oh!

Uh oh!

Vilhelm-Ian Mar 27, 2024 Author

Uh oh!

Uh oh!

mortii Mar 28, 2024 Maintainer

Uh oh!

Uh oh!

mortii Mar 29, 2024 Maintainer

Uh oh!

Uh oh!

Vilhelm-Ian Apr 5, 2024 Author

Uh oh!

Uh oh!

mortii Apr 17, 2024 Maintainer

Uh oh!

Uh oh!

Vilhelm-Ian Apr 17, 2024 Author

Uh oh!

mortii Apr 18, 2024 Maintainer

Uh oh!

Uh oh!

mortii Apr 17, 2024 Maintainer

Edit

Uh oh!

mortii Apr 17, 2024 Maintainer

Uh oh!

Uh oh!

Uh oh!

mortii Apr 18, 2024 Maintainer

Uh oh!

Replies: 36 comments 59 replies

Vilhelm-Ian
Mar 19, 2024
Author

mortii
Mar 19, 2024
Maintainer

mortii Mar 20, 2024
Maintainer

mortii Mar 20, 2024
Maintainer

Vilhelm-Ian
Mar 20, 2024
Author

mortii
Mar 22, 2024
Maintainer

Vilhelm-Ian Mar 25, 2024
Author

Vilhelm-Ian Mar 27, 2024
Author

mortii Mar 28, 2024
Maintainer

mortii Mar 29, 2024
Maintainer

Vilhelm-Ian Apr 5, 2024
Author

mortii
Apr 17, 2024
Maintainer

Vilhelm-Ian Apr 17, 2024
Author

mortii Apr 18, 2024
Maintainer

mortii
Apr 17, 2024
Maintainer

mortii Apr 17, 2024
Maintainer

mortii Apr 18, 2024
Maintainer