next: Changes queued for 2017.11 release #1250

lukego · 2017-11-17T11:33:30Z

This release will follow quickly on the heels of the much-delayed v2017.09 release.

Initialisation of the shm frame needs to be done in the constructor method to avoid being overwritten by the frame created in core.app. Add a copy of the mtu to the device-specific stats table, add a counter to record the speed of the interface.

Avoid using index sets and just query the registers since it's hard to share this state among the apps. Also do the MAC registration in the right place.

RSS is always enabled, so there shouldn't be a case in which this is a no-op anyway. Also the check is complicated by the fact that RSS can be enabled in many ways (with DCB, VMDq, etc.).

Also add code to unset MAC on stop()

This code is mostly copied from the intel10g.lua driver. Also add a new VLAN test and adjust old tests as needed.

The previous iteration was unreliable for unknown reasons. This refactoring should be clearer and run more reliably.

Also add new test for VLAN stripping

* PFVFTE to enable Tx from appropriate VFs * RTTD1TC for bandwidth allocation algorithm for Tx

The former wasn't updated for an API change and the latter was missing a module declaration.

The tag insertion part of this test didn't work because the MAC didn't match when the app tried to resend the packet. It's not necessary since the transmit test covers this anyway.

Use intel_mp driver instead of intel10g driver

…ate-nov2017

lukego · 2017-11-21T11:04:14Z

Huzzah! The new intel_mp driver is now adopted as the default and the intel10g driver (which represents some of the first lines of code ever written in Snabb) becomes a legacy backup option.

Great hacking to all the many intel_mp hackers !!! 👍

`mode` was not passed to `io.open`.

Set mask.size equals to MAX_NUMNODES if lower

lukego · 2017-12-05T08:35:21Z

Sorry about the wait. I wanted to recommend this for release now but I see a performance regression that needs to be resolved. I will first check if this is related to overuse of trace barriers in #1242.

lukego · 2017-12-06T08:39:38Z

Confirmed that the performance regression is due to the JIT barrier. The lukego-optimize performance with that change reverted matched master again. I will test using fewer barriers (on entry or exit to an app, but not both) and see if that is better. Otherwise the JIT barrier might not be suitable (too expensive) for app/engine transitions.

lukego · 2017-12-08T08:18:41Z

I have reverted the calls to jit.tracebarrier() around app callbacks. These seem to be a bit of a performance drag overall and their benefit is not really established.

So the jit.tracebarrier() primitive still exists but the engine doesn't currently use it.

Let's wait for the standard Hydra tests to complete now and then we should be 👍 for the release.

lukego · 2017-12-11T09:55:02Z

@eugeneia There is a performance regression on the iperf benchmark but I propose that we ship this now anyway.

I have been combing through recent CI results with @wingo on Slack and it seems like the problem is caused by voodoo. I have another branch with almost identical contents that does not show the issue. The only difference is whether the jit.tracebarrier() primitive exists in the C code. Having that code present seems to provoke the problem even though it is never called.

I am reluctant to make a "nonsense" change to "solve" this problem. I would prefer to accept it for now and focus on finding the root cause of why we see variance in the iperf benchmark. I see the new RaptorJIT tooling as the way to do this and so I want to spend my time now on integrating that with Snabb. Hence my willingness to accept this symptom of the root problem (wider variance on the iperf benchmark) for the moment.

Hypothetically if the problem is something obscure, like whether two Lua loop bytecode addresses hash into the same JIT hot counter, then there are probably very many different ways that it could be provoked (e.g. choice of C compiler version) and so I am not really confident that nailing down one such issue in the test environment would translate into a real world benefit. This would need to be solved more thoroughly in the JIT after seeing exactly what is really going on.

WDYT?

eugeneia · 2017-12-11T16:45:33Z

Totally agree. I think #1244 is not included in this release, or is it? Maybe, as a last resort I would test to see if it resolves this regression like it did mine. This doesn’t block the release from my point of view though. Ready when you are!

lukego · 2017-12-11T17:51:14Z

@eugeneia Glad to hear that this change has helped you. I pushed it here and let's see what Hydra says tomorrow.

lukego · 2017-12-12T09:38:46Z

@eugeneia This was worth a try but the new benchmark results show the same issue: https://hydra.snabb.co/build/2822290/download/2/report.html#iperf.

wingo · 2017-12-12T12:54:59Z

In a way that's good to know that the results are the same. I was unsure whether to blame a voodoo change to the software itself or some problem with the statistics!

alexandergall and others added 30 commits July 28, 2017 08:32

First attempt at VMDq for intel_mp

efbb5e7

Fix up VMDq mode so that it actually runs

60474bc

First attempt at tests for VMDq mode

19edfb1

Try testing different MAC addrs with VMDq

70bd72d

Fix MAC address registration in VMDq mode

284ba9f

Avoid using index sets and just query the registers since it's hard to share this state among the apps. Also do the MAC registration in the right place.

Add comment explaining some RSS code

70968dd

Enable RSS queues via PSRTYPE properly for VMDq

32857a2

Use :bits to set the VMDq RSS mode properly

aca0bf3

Remove RSS bit check for setting RETA

cc02601

RSS is always enabled, so there shouldn't be a case in which this is a no-op anyway. Also the check is complicated by the fact that RSS can be enabled in many ways (with DCB, VMDq, etc.).

Fix RTTPCS bit set that had a wrong length arg

12a55f0

Adjust test to test that both vmdq pools get pkts

162eac0

Move MAC pool enable code and make it cross-NIC

97e3634

Also add code to unset MAC on stop()

Adjust the VMDq initialization assertions

09c3bc6

Error when the max number of MAC addresses is reached

6240e9a

Add code for enabling VLAN filtering/tagging

cf044c7

This code is mostly copied from the intel10g.lua driver. Also add a new VLAN test and adjust old tests as needed.

Refactor vmdq test script

e314237

The previous iteration was unreliable for unknown reasons. This refactoring should be clearer and run more reliably.

Enable VLAN takedown code for VMDq

88c0810

Refactor intel_mp recv test scripts to reduce duplication

c3b58d0

Enable VLAN tag stripping in VMDq mode

6991cce

Also add new test for VLAN stripping

Adjust VLAN test to also test tag insertion

432e35a

VMDq settings for transmit queues

ec0c436

Add a test for VLAN tag insertion & VMDq tx

c303d30

Set registers needed to make Tx test work

45a7494

* PFVFTE to enable Tx from appropriate VFs * RTTD1TC for bandwidth allocation algorithm for Tx

Fix assertion for Rx queue num to allow nil

e8a1c36

Adjust intel_mp to error with VMDq on non-82599

e1766fb

Remove unnecessary helper function

8e07ff3

Fix permissions on two intel_mp tests

67420dd

Fix 1q vmdq test and testrecv.lua

e6f2b05

The former wasn't updated for an API change and the latter was missing a module declaration.

Adjust VLAN test to avoid testing tag insertion

5dc4dcd

The tag insertion part of this test didn't work because the MAC didn't match when the app tried to resend the packet. It's not necessary since the transmit test covers this anyway.

lukego and others added 10 commits November 16, 2017 13:02

Merge #1231 branch 'snabbco/wingo-next' into next

36a44e9

Merge missed commits from #1231 branch 'snabbco/wingo-next' into next

59de571

Merge pull request #1237 from Igalia/migrate-to-intel-mp

bc50a0c

Use intel_mp driver instead of intel10g driver

Merge #1190 branch 'alexg/intel-mp-shm' into next

f97d74b

Merge #1225 branch 'alexg/siphash-ctype-diversity' into lukego-integr…

6cba381

…ate-nov2017

Merge #1240 branch 'krawthekrow/fixes' into lukego-integrate-nov2017

c178ba9

Merge #151 branch 'lukego-integrate-nov2017' into next

804db04

Merge #1249 branch 'snabbco/wingo-next' into next

cd29714

Merge #1242 branch 'lukego/jit-tracebarrier' into next

e8d0af7

Merge #1245 branch 'eugeneia/log-segfault-address' into next

807fb4e

Fabian Bonk and others added 7 commits November 22, 2017 14:52

Fix file mode

c76a991

`mode` was not passed to `io.open`.

Set mask.size equals to MAX_NUMNODES if lower

245e0f0

Rewrite get_maxnumnodes function to not depend on a fixed filesize

a2fd55f

Rework readfile function and add comment on how to calculate maxnumnodes

776601a

Merge pull request #1228 from dpino/fix-get_mempolicy

22e5b79

Set mask.size equals to MAX_NUMNODES if lower

Merge #1255 branch 'snabbco/wingo-next' into next

245b75d

Merge #1254 branch 'Reperator/master' into next

2d790f5

engine: Remove jit.barrier() calls around apps (too expensive)

50ced99

Merge #1244 branch 'lukego/record-blacklisted-functions' into next

6c0f065

eugeneia merged commit 6c0f065 into master Dec 12, 2017

eugeneia added a commit that referenced this pull request Dec 12, 2017

Merge PR #1250 (v2017.11 release) into master

248aae7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

next: Changes queued for 2017.11 release #1250

next: Changes queued for 2017.11 release #1250

Uh oh!

lukego commented Nov 17, 2017

Uh oh!

lukego commented Nov 21, 2017

Uh oh!

lukego commented Dec 5, 2017

Uh oh!

lukego commented Dec 6, 2017

Uh oh!

lukego commented Dec 8, 2017

Uh oh!

lukego commented Dec 11, 2017 •

edited

Loading

Uh oh!

eugeneia commented Dec 11, 2017

Uh oh!

lukego commented Dec 11, 2017

Uh oh!

lukego commented Dec 12, 2017

Uh oh!

wingo commented Dec 12, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

next: Changes queued for 2017.11 release #1250

next: Changes queued for 2017.11 release #1250

Uh oh!

Conversation

lukego commented Nov 17, 2017

Uh oh!

lukego commented Nov 21, 2017

Uh oh!

lukego commented Dec 5, 2017

Uh oh!

lukego commented Dec 6, 2017

Uh oh!

lukego commented Dec 8, 2017

Uh oh!

lukego commented Dec 11, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eugeneia commented Dec 11, 2017

Uh oh!

lukego commented Dec 11, 2017

Uh oh!

lukego commented Dec 12, 2017

Uh oh!

wingo commented Dec 12, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

lukego commented Dec 11, 2017 •

edited

Loading