All notable changes to VOLK will be documented in this file.
The format is based on Keep a Changelog and this project adheres to Semantic Versioning, starting with version 2.0.0.
This is the first version to use semantic versioning. It starts the logging of changes.
Hi everyone,
we would like to announce that Michael Dickens and Johannes Demel are the new VOLK maintainers. We want to review and merge PRs in a timely manner as well as commenting on issues in order to resolve them.
We want to thank all contributors. This release wouldn't have been possible without them.
We're curious about VOLK users. Especially we'd like to learn about VOLK users who use VOLK outside GNU Radio.
If you have ideas for VOLK enhancements, let us know. Start with an issue to discuss your idea. We'll be happy to see new features get merged into VOLK.
VOLK v2.1.0 is a collection of really cool changes. We'd like to highlight some of them.
- The AVX FMA rotator bug is fixed
- VOLK offers
volk::vector<>
for C++ to follow RAII - Move towards modern dependencies
- CMake 3.8
- Prefer Python3
- We will drop Python2 support in a future release!
- Use C++17
std::filesystem
- This enables VOLK to be built without Boost if available!
- more stable CI
- lots of bugfixes
- more optimized kernels, especially more NEON versions
- Albin Stigo [email protected]
- Amr Bekhit [email protected]
- Andrej Rode [email protected]
- Carles Fernandez [email protected]
- Christoph Mayer [email protected]
- Clayton Smith [email protected]
- Damian Miralles [email protected]
- Johannes Demel [email protected] [email protected]
- Marcus Müller [email protected]
- Michael Dickens [email protected]
- Philip Balister [email protected]
- Takehiro Sekine [email protected]
- Vasil Velichkov [email protected]
- luz.paz [email protected]
-
Usage
- Update README to reflect how to build on Raspberry Pi and the importance of running volk_profile
-
Toolchain
- Add toolchain file for Raspberry Pi 3
- Update Raspberry 4 toolchain file
-
Kernels
- Add neonv7 to volk_16ic_magnitude_16i
- Add neonv7 to volk_32fc_index_max_32u
- Add neonv7 to volk_32fc_s32f_power_spectrum_32f
- Add NEONv8 to volk_32f_64f_add_64f
- Add Neonv8 to volk_32fc_deinterleave_64f_x2
- Add volk_32fc_x2_s32fc_multiply_conjugate_add_32fc
- Add NEONv8 to volk_32fc_convert_16ic
-
CI
- Fix AVX FMA rotator
- appveyor: Enable testing on windows
- Fixes for flaky kernels for more reliable CI
- volk_32f_log2_32f
- volk_32f_x3_sum_of_poly_32f
- volk_32f_index_max_{16,32}u
- volk_32f_8u_polarbutterflypuppet_32f
- volk_8u_conv_k7_r2puppet_8u
- volk_32fc_convert_16ic
- volk_32fc_s32f_magnitude_16i
- volk_32f_s32f_convert_{8,16,32}i
- volk_16ic_magnitude_16i
- volk_32f_64f_add_64f
- Use Intel SDE to test all kernels
- TravisCI
- Add native tests on arm64
- Add native tests on s390x and ppc64le (allow failure)
-
Build
- Build Volk without Boost if C++17 std::filesystem or std::experimental::filesystem is available
- Update to more modern CMake
- Prevent CMake to choose previously installed VOLK headers
- CMake
- bump minimum version to 3.8
- Use sha256 instead of md5 for unique target name hash
- Python: Prefer Python3 over Python2 if available
-
C++
- VOLK C++ allocator and C++11 std::vector type alias added \n
Hi everyone,
we have a new VOLK release v2.2.0!
We want to thank all contributors. This release wouldn't have been possible without them.
We're curious about VOLK users. Especially we'd like to learn about VOLK users who use VOLK outside GNU Radio.
If you have ideas for VOLK enhancements, let us know. Start with an issue to discuss your idea. We'll be happy to see new features get merged into VOLK.
The v2.1.0 release was rather large because we had a lot of backlog. We aim for more incremental releases in order to get new features out there.
VOLK v2.2.0 updates our build tools and adds support functionality to make it easier to use VOLK in your projects.
- Dropped Python 2 build support
- Removed Python six module dependency
- Use C11 aligned_alloc whenever possible
- MacOS
posix_memalign
fall-back - MSVC
_aligned_malloc
fall-back
- MacOS
- Add VOLK version in
volk_version.h
(included involk.h
) - Improved CMake code
- Improved code with lots of refactoring and performance tweaks
- Carles Fernandez [email protected]
- Gwenhael Goavec-Merou [email protected]
- Albin Stigo [email protected]
- Johannes Demel [email protected] [email protected]
- Michael Dickens [email protected]
- Valerii Zapodovnikov [email protected]
- Vasil Velichkov [email protected]
- ghostop14 [email protected]
- rear1019 [email protected]
- CMake
- Fix detection of AVX and NEON
- Fix for macOS
- lib/CMakeLists: use asm instead of asm for ARM tests
- lib/CMakeLists: fix detection when compiler support NEON but nor neonv7 nor neonv8
- lib/CMakeLists.txt: use __VOLK_ASM instead of asm
- lib/CMakeLists.txt: let VOLK choose preferred neon version when both are supported
- lib/CMakeLists.txt: simplify neon test support. Unset neon version if not supported
- For attribute, change from clang to "clang but not MSC"
- Readme
- logo: Add logo at top of README.md
- Build dependencies
- python: Drop Python2 support
- python: Reduce six usage
- python: Move to Python3 syntax and modules
- six: Remove build dependency on python six
- Allocation
- alloc: Use C11 aligned_alloc
- alloc: Implement fall backs for C11 aligned_alloc
- alloc: Fix for incomplete MSVC standard compliance
- alloc: update to reflect alloc changes
- Library usage
- Fixup VolkConfigVersion
- add volk_version.h
- Refactoring
- qa_utils.cc: fix always false expression
- volk_prefs.c: check null realloc and use temporary pointer
- volk_profile.cc: double assignment and return 0
- volk_32f_x2_pow_32f.h: do not need to _mm256_setzero_ps()
- volk_8u_conv_k7_r2puppet_8u.h: d_polys[i] is positive
- kernels: change one iteration for's to if's
- kernels: get rid of some assignments
- qa_utils.cc: actually throw something
- qa_utils.cc: fix always true code
- rotator: Refactor AVX kernels
- rotator: Remove unnecessary variable
- kernel: Refactor square_dist_scalar_mult
- square_dist_scalar_mult: Speed-Up AVX, Add unaligned
- square_dist_scalar_mult: refactor AVX2 kernel
- kernel: create AVX2 meta intrinsics
- CI
- appveyor: Test with python 3.4 and 3.8
- appveyor: Add job names
- appveyor: Make ctest more verbose
- Performance
- Improve performance of generic kernels with complex multiply
- square_dist_scalar_mult: Add SSE version
- Adds NEON versions of cos, sin and tan
Hi everyone,
with VOLK 2.2.0, we introduced another AVX rotator bug which is fixed with this release. In the process 2 more bugs were identified and fixed. Further, we saw some documentation improvements.
- Clayton Smith [email protected]
- Michael Dickens [email protected]
- Fix loop bound in AVX rotator
- Fix out-of-bounds read in AVX2 square dist kernel
- Fix length checks in AVX2 index max kernels
- includes: rearrange attributes to simplify macros Whitespace
- kernels: fix usage in header comments \n
Hi everyone!
VOLK 2.3 is out! We want to thank all contributors. This release wouldn't have been possible without them. We saw lots of great improvements.
On GNU Radio 'master' VOLK was finally removed as a submodule.
Currently we see ongoing discussions on how to improve CPU feature detection because VOLK is not as reliable as we'd like it to be in that department. We'd like to benefit from other open source projects and don't want to reinvent the wheel. Thus, one approach would be to include cpu_features
as a submodule.
- Better reproducible builds
- CMake improvements
- ORC is removed from the public interface where it was never supposed to be.
- CMake fixes for better usability
- Updated and new CI chain
- TravisCI moved to new distro and does more tests for newer GCC/Clang
- Github Actions
- Add Action to check proper code formatting.
- Add CI that also runs on MacOS with XCode.
- Enforce C/C++ coding style via clang-format
- Kernel fixes
- Add puppet for
power_spectral_density
kernel - Treat the
mod_range
puppet as a puppet for correct use withvolk_profile
- Fix
index_max
kernels - Fix
rotator
. We hope that we finally found the root cause of the issue.
- Add puppet for
- Kernel optimizations
- Updated log10 calcs to use faster log2 approach
- Optimize
complexmultiplyconjugate
- New kernels
- accurate exp kernel is finally merged after years
- Add 32f_s32f_add_32f kernel to perform vector + scalar float operation
- Bernhard M. Wiedemann [email protected]
- Clayton Smith [email protected]
- Johannes Demel [email protected] [email protected]
- Michael Dickens [email protected]
- Tom Rondeau [email protected]
- Vasil Velichkov [email protected]
- ghostop14 [email protected]
- Reproducible builds
- Drop compile-time CPU detection
- Drop another instance of compile-time CPU detection
- CI updates
- ci: Add Github CI Action
- ci: Remove Ubuntu 16.04 GCC5 test on TravisCI
- TravisCI: Update CI to bionic distro
- TravisCI: Add GCC 8 test
- TravisCI: Structure CI file
- TravisCI: Clean-up .travis.yml
- Enforce C/C++ coding style
- clang-format: Apply clang-format
- clang-format: Update PR with GitHub Action
- clang-format: Rebase onto current master
- Fix compiler warnings
- shadow: rebase kernel fixes
- shadow: rebase volk_profile fix
- CMake
- cmake: Remove the ORC from the VOLK public link interface
- versioning: Remove .Maint from libvolk version
- Fix endif macro name in comment
- Kernels
- volk: accurate exp kernel
- exp: Rename SSE4.1 to SSE2 kernel
- Add 32f_s32f_add_32f kernel
- This kernel adds in vector + scalar functionality
- Fix the broken index max kernels
- Treat the mod_range puppet as such
- Add puppet for power spectral density kernel
- Updated log10 calcs to use faster log2 approach
- fix: Use unaligned load
- divide: Optimize complexmultiplyconjugate
- volk: accurate exp kernel
Hi everyone!
We have another VOLK release! We're happy to announce VOLK v2.4.0! We want to thank all contributors. This release wouldn't have been possible without them.
We introduce cpu_features
as a private submodule in this release because we use it to detect CPU features during runtime now. This should enable more reliable feature detection. Further, it takes away the burden to implement platform specific code. As a result, optimized VOLK kernels build for MacOS and Windows with AppleClang/MSVC out-of-the-box now.
-
Documentation
- Update README to be more verbose and to improve usefulness.
-
Compilers
- MSVC: Handle compiler flags and thus architecture specific kernels correctly. This enables optimized kernels with MSVC builds.
- AppleClang: Treat AppleClang as Clang.
- Paired with the
cpu_features
introduction, this enables us to use architecture specific kernels on a broader set of platforms.
-
CI
- Update to newest version of the Intel SDE
- Test the rotator kernel with a realistic scalar
- Introduce more test cases with newer GCC and newer Clang versions.
-
CMake
- Enable to not install
volk_modtool
. - Remove "find_package_handle_standard_args" warning.
- Enable to not install
-
cpu_features
- Use
cpu_features
v0.6.0 as a private submodule to detect available CPU features. - Fix incorrect feature detection for newer AVX versions.
- Circumvent platform specific feature detection.
- Enable more architecture specific kernels on more platforms.
- Use
-
Kernels
- Disable slow and broken SSE4.1 kernel in
volk_32fc_x2_dot_prod_32fc
- Adjust min/max for
32f_s32f_convert_8i
kernel - Use
INT8_*
instead ofCHAR_*
- Disable slow and broken SSE4.1 kernel in
- Adam Thompson [email protected]
- Andrej Rode [email protected]
- Christoph Mayer [email protected]
- Clayton Smith [email protected]
- Doron Behar [email protected]
- Johannes Demel [email protected], [email protected]
- Martin Kaesberger [email protected]
- Michael Dickens [email protected]
- Ron Economos [email protected]
-
Documentation
- Update README to include ldconfig upon volk build and install completion
- Update README based on review
- readme: Fix wording
- docs: Fix conversion inaccuracy
-
MSVC
- archs: MSVC 2013 and greater don't have a SSE flag
-
CI
- update to newest version of the Intel SDE
- Test the rotator kernel with a realistic scalar
-
CMake
- build: Enable to not install volk_modtool
- cmake: Remove "find_package_handle_standard_args" warning.
- cmake: Ensure that cpu_features is built as a static library.
-
cpu_features
- readme: Add section on supported platforms
- readme: Make supported compiler section more specific
- travis: Add GCC 9 test on focal
- travis: Add tests for clang 8, 9, 10
- travis: Fix incorrect CXX compiler assignment
- cpu_features: Remove unused feature checks
- ci: Update TravisCI for cpu_features
- cpu_features: Fix MSVC build
- pic: Fix BUILD_PIC issue
- ci: Update CI system configuration
- cpu_features: Bump submodule pointer to v0.6.0
- docs: Add hints how to handle required submodules
- cpu_features: Switch to cpu_features
- ci: Update CI system for cpu_features
- cpu_features: Force PIC build flag
- appveyor: Add recursive clone command
- cpu_features: Remove xgetbv checks
- pic: Cache and force BUILD_PIC
- ci: Remove explicit BUILD_PIC from cmake args
- ci: Remove BUILD_PIC from CI cmake args
- cpu_features: Remove commented code
- cpu_features: Assume AppleClang == Clang
- cpu_features: Remove obsolete code in archs.xml
- fix for ARM cross-compiling CI
- ARM CI: remove unneeded environment variables
-
Housekeeping
- structure: Move release scripts to scripts folder
-
Kernels
- emit an emms instruction after using the mmx extension
- volk_32fc_x2_dot_prod_32fc: disable slow & broken SSE4.1 kernel
- fix: Adjust min/max for 32f_s32f_convert_8i kernel
- fix: Use INT8_* instead of CHAR_*
Hi everyone!
We have a new VOLK bugfix release! We are happy to announce VOLK v2.4.1! We want to thank all contributors. This release wouldn't have been possible without them.
Our v2.4.0 release introduced quite a lot of changes under the hood. With this bugfix release, we want to make sure that everything works as expected again.
- A. Maitland Bottoms [email protected]
- Johannes Demel [email protected]
- Michael Dickens [email protected]
- Philip Balister [email protected]
- Ron Economos [email protected]
- Ryan Volz [email protected]
-
Build
- cpu_features CMake option
- Add cpu_features to static library build.
- Use static liborc-0.4 library for static library build.
- cmake: Detect if cpu_features submodule is present.
-
Install
- Check for lib64 versus lib and set LIB_SUFFIX accordingly.
-
CI
- Add CI test for static library build.
-
Releases
- project: Include git submodules (i.e. cpu_features) in release tarball.
- scripts: Add GPG signature to release script
-
Other
- readme: Update TravisCI status badge