rust - https://github.com/rust-lang/rust

Age	Commit message (Collapse)	Author	Lines
2016-12-30	Feature gate: 1.16.0 instead of 1.15.0	est31	-1/+1

2016-12-30	Fix LEB128 to work with the stage1	Simonas Kazlauskas	-2/+2
	Stage 1 can’t really handle negative 128-bit literals, but an equivalent bit-not is fine
2016-12-30	Cleanup FIXMEs	Simonas Kazlauskas	-22/+0

2016-12-30	Feature gate the 128 bit types	Simonas Kazlauskas	-0/+15
	Dangling a carrot in front of a donkey. This commit includes manual merge conflict resolution changes from a rebase by @est31.
2016-12-30	Such large. Very 128. Much bits.	Simonas Kazlauskas	-22/+52
	This commit introduces 128-bit integers. Stage 2 builds and produces a working compiler which understands and supports 128-bit integers throughout. The general strategy used is to have rustc_i128 module which provides aliases for iu128, equal to iu64 in stage9 and iu128 later. Since nowhere in rustc we rely on large numbers being supported, this strategy is good enough to get past the first bootstrap stages to end up with a fully working 128-bit capable compiler. In order for this strategy to work, number of locations had to be changed to use associated max_value/min_value instead of MAX/MIN constants as well as the min_value (or was it max_value?) had to be changed to use xor instead of shift so both 64-bit and 128-bit based consteval works (former not necessarily producing the right results in stage1). This commit includes manual merge conflict resolution changes from a rebase by @est31.
2016-12-29	Remove not(stage0) from deny(warnings)	Alex Crichton	-1/+1
	Historically this was done to accommodate bugs in lints, but there hasn't been a bug in a lint since this feature was added which the warnings affected. Let's completely purge warnings from all our stages by denying warnings in all stages. This will also assist in tracking down `stage0` code to be removed whenever we're updating the bootstrap compiler.
2016-12-29	Rollup merge of #38559 - japaric:ptx2, r=alexcrichton	Alex Crichton	-1/+18
	PTX support, take 2 - You can generate PTX using `--emit=asm` and the right (custom) target. Which then you can run on a NVIDIA GPU. - You can compile `core` to PTX. [Xargo] also works and it can compile some other crates like `collections` (but I doubt all of those make sense on a GPU) [Xargo]: https://github.com/japaric/xargo - You can create "global" functions, which can be "called" by the host, using the `"ptx-kernel"` ABI, e.g. `extern "ptx-kernel" fn kernel() { .. }`. Every other function is a "device" function and can only be called by the GPU. - Intrinsics like `__syncthreads()` and `blockIdx.x` are available as `"platform-intrinsics"`. These intrinsics are not in the `core` crate but any Rust user can create "bindings" to them using an `extern "platform-intrinsics"` block. See example at the end. - Trying to emit PTX with `-g` (debuginfo); you get an LLVM error. But I don't think PTX can contain debuginfo anyway so `-g` should be ignored and a warning should be printed ("`-g` doesn't work with this target" or something). - "Single source" support. You can't write a single source file that contains both host and device code. I think that should be possible to implement that outside the compiler using compiler plugins / build scripts. - The equivalent to CUDA `__shared__` which it's used to declare memory that's shared between the threads of the same block. This could be implemented using attributes: `#[shared] static mut SCRATCH_MEMORY: [f32; 64]` but hasn't been implemented yet. - Built-in targets. This PR doesn't add targets to the compiler just yet but one can create custom targets to be able to emit PTX code (see the example at the end). The idea is to have people experiment with this feature before committing to it (built-in targets are "insta-stable") - All functions must be "inlined". IOW, the `.rlib` must always contain the LLVM bitcode of all the functions of the crate it was produced from. Otherwise, you end with "undefined references" in the final PTX code but you won't get any linker error because no linker is involved. IOW, you'll hit a runtime error when loading the PTX into the GPU. The workaround is to use `#[inline]` on non-generic functions and to never use `#[inline(never)]` but this may not always be possible because e.g. you could be relying on third party code. - Should `--emit=asm` generate a `.ptx` file instead of a `.s` file? TL;DR Use Xargo to turn a crate into a PTX module (a `.s` file). Then pass that PTX module, as a string, to the GPU and run it. The full code is in [this repository]. This section gives an overview of how to run Rust code on a NVIDIA GPU. [this repository]: https://github.com/japaric/cuda - Create a custom target. Here's the 64-bit NVPTX target (NOTE: the comments are not valid because this is supposed to be a JSON file; remove them before you use this file): ``` js // nvptx64-nvidia-cuda.json { "arch": "nvptx64", // matches LLVM "cpu": "sm_20", // "oldest" compute capability supported by LLVM "data-layout": "e-i64:64-v16:16-v32:32-n16:32:64", "llvm-target": "nvptx64-nvidia-cuda", "max-atomic-width": 0, // LLVM errors with any other value :-( "os": "cuda", // matches LLVM "panic-strategy": "abort", "target-endian": "little", "target-pointer-width": "64", "target-vendor": "nvidia", // matches LLVM -- not required } ``` (There's a 32-bit target specification in the linked repository) - Write a kernel ``` rust extern "platform-intrinsic" { fn nvptx_block_dim_x() -> i32; fn nvptx_block_idx_x() -> i32; fn nvptx_thread_idx_x() -> i32; } /// Copies an array of `n` floating point numbers from `src` to `dst` pub unsafe extern "ptx-kernel" fn memcpy(dst: mut f32, src: const f32, n: usize) { let i = (nvptx_block_dim_x() as isize) .wrapping_mul(nvptx_block_idx_x() as isize) .wrapping_add(nvptx_thread_idx_x() as isize); if (i as usize) < n { dst.offset(i) = src.offset(i); } } ``` - Emit PTX code ``` $ xargo rustc --target nvptx64-nvidia-cuda --release -- --emit=asm Compiling core v0.0.0 (file://..) (..) Compiling nvptx-builtins v0.1.0 (https://github.com/japaric/nvptx-builtins) Compiling kernel v0.1.0 $ cat target/nvptx64-nvidia-cuda/release/deps/kernel-.s // // Generated by LLVM NVPTX Back-End // .version 3.2 .target sm_20 .address_size 64 // .globl memcpy .visible .entry memcpy( .param .u64 memcpy_param_0, .param .u64 memcpy_param_1, .param .u64 memcpy_param_2 ) { .reg .pred %p<2>; .reg .s32 %r<5>; .reg .s64 %rd<12>; ld.param.u64 %rd7, [memcpy_param_2]; mov.u32 %r1, %ntid.x; mov.u32 %r2, %ctaid.x; mul.wide.s32 %rd8, %r2, %r1; mov.u32 %r3, %tid.x; cvt.s64.s32 %rd9, %r3; add.s64 %rd10, %rd9, %rd8; setp.ge.u64 %p1, %rd10, %rd7; @%p1 bra LBB0_2; ld.param.u64 %rd3, [memcpy_param_0]; ld.param.u64 %rd4, [memcpy_param_1]; cvta.to.global.u64 %rd5, %rd4; cvta.to.global.u64 %rd6, %rd3; shl.b64 %rd11, %rd10, 2; add.s64 %rd1, %rd6, %rd11; add.s64 %rd2, %rd5, %rd11; ld.global.u32 %r4, [%rd2]; st.global.u32 [%rd1], %r4; LBB0_2: ret; } ``` - Run it on the GPU ``` rust // `kernel.ptx` is the `.s` file we got in the previous step const KERNEL: &'static str = include_str!("kernel.ptx"); driver::initialize()?; let device = Device(0)?; let ctx = device.create_context()?; let module = ctx.load_module(KERNEL)?; let kernel = module.function("memcpy")?; let h_a: Vec<f32> = /* create some random data /; let h_b = vec![0.; N]; let d_a = driver::allocate(bytes)?; let d_b = driver::allocate(bytes)?; // Copy from host to GPU driver::copy(h_a, d_a)?; // Run `memcpy` on the GPU kernel.launch(d_b, d_a, N)?; // Copy from GPU to host driver::copy(d_b, h_b)?; // Verify assert_eq!(h_a, h_b); // `d_a`, `d_b`, `h_a`, `h_b` are dropped/freed here ``` --- cc @alexcrichton @brson @rkruppe > What has changed since #34195? - `core` now can be compiled into PTX. Which makes it very easy to turn `no_std` crates into "kernels" with the help of Xargo. - There's now a way, the `"ptx-kernel"` ABI, to generate "global" functions. The old PR required a manual step (it was hack) to "convert" "device" functions into "global" functions. (Only "global" functions can be launched by the host) - Everything is unstable. There are not "insta stable" built-in targets this time (\). The users have to use a custom target to experiment with this feature. Also, PTX instrinsics, like `__syncthreads` and `blockIdx.x`, are now implemented as `"platform-intrinsics"` so they no longer live in the `core` crate. (\*) I'd actually like to have in-tree targets because that makes this target more discoverable, removes the need to lug around .json files, etc. However, bundling a target with the compiler immediately puts it in the path towards stabilization. Which gives us just two cycles to find and fix any problem with the target specification. Afterwards, it becomes hard to tweak the specification because that could be a breaking change. A possible solution could be "unstable built-in targets". Basically, to use an unstable target, you'll have to also pass `-Z unstable-options` to the compiler. And unstable targets, being unstable, wouldn't be available on stable. > Why should this be merged? - To let people experiment with the feature out of tree. Having easy access to the feature (in every nightly) allows this. I also think that, as it is, it should be possible to start prototyping type-safe single source support using build scripts, macros and/or plugins. - It's a straightforward implementation. No different that adding support for any other architecture.
2016-12-29	Use `?` instead of `try!` macro in `print::pprust`	Esteban Küber	-857/+857

2016-12-28	hir: lower `ImplicitSelf` to resolved `Self` TyQPath's.	Eduard-Mihai Burtescu	-8/+7

2016-12-26	PTX support	Jorge Aparicio	-1/+18
	- `--emit=asm --target=nvptx64-nvidia-cuda` can be used to turn a crate into a PTX module (a `.s` file). - intrinsics like `__syncthreads` and `blockIdx.x` are exposed as `"platform-intrinsics"`. - "cabi" has been implemented for the nvptx and nvptx64 architectures. i.e. `extern "C"` works. - a new ABI, `"ptx-kernel"`. That can be used to generate "global" functions. Example: `extern "ptx-kernel" fn kernel() { .. }`. All other functions are "device" functions.
2016-12-25	Auto merge of #38566 - jseyfried:fix_import_resolution_bug, r=eddyb	bors	-15/+14
	Fix bug in import resolution Fixes #38535 and fixes #38556. r? @nrc
2016-12-24	Auto merge of #38268 - withoutboats:parse_where_higher_rank_hack, r=eddyb	bors	-0/+17
	Prevent where < ident > from parsing. In order to be forward compatible with `where<'a>` syntax for higher rank parameters, prevent potential conflicts with UFCS from parsing correctly for the near term.
2016-12-23	Auto merge of #38533 - jseyfried:legacy_custom_derive_deprecation, r=nrc	bors	-0/+2
	Allow legacy custom derive authors to disable warnings in downstream crates This PR allows legacy custom derive authors to use a pre-deprecated method `registry.register_custom_derive()` instead of `registry.register_syntax_extension()` to avoid downstream deprecation warnings. r? @nrc
2016-12-23	Auto merge of #38232 - jseyfried:refactor_global_paths, r=nrc	bors	-57/+84
	Refactor global paths This PR removes the field `global: bool` from `ast::Path` and `hir::Path`, instead representing a global path `::foo::bar` as `{{root}}::foo::bar`, where `{{root}}` is a virtual keyword `keywords::CrateRoot`. Also, fixes #38016. r? @nrc
2016-12-23	Allow legacy custom derive authors to disable warnings in downstream crates.	Jeffrey Seyfried	-0/+2

2016-12-23	Fix import resolution bug and fold all idents in the AST.	Jeffrey Seyfried	-15/+14

2016-12-22	Remove outdated FIXME comment	Esteban Küber	-4/+0
	Removed FIXME comment referencing #3300.
2016-12-22	Pretty-print `$crate::foo::bar` as `::foo::bar`.	Jeffrey Seyfried	-1/+2

2016-12-22	Refactor how global paths are represented (for both ast and hir).	Jeffrey Seyfried	-57/+83

2016-12-21	Auto merge of #38099 - GuillaumeGomez:cast_suggestions, r=nikomatsakis	bors	-0/+4
	Cast suggestions r? @nikomatsakis
2016-12-20	Rollup merge of #38171 - jseyfried:cleanup, r=nrc	Alex Crichton	-344/+145
	Miscellaneous cleanup/refactoring in `resolve` and `syntax::ext` r? @nrc
2016-12-20	Create check_ref method to allow to check coercion with & types	Guillaume Gomez	-6/+0

2016-12-20	Fix coercion ICE	Guillaume Gomez	-1/+2

2016-12-20	Add safe_suggestion attribute	Guillaume Gomez	-0/+9

2016-12-19	Optimize `ast::PathSegment`.	Jeffrey Seyfried	-184/+66

2016-12-19	Auto merge of #38194 - sgrif:sg-no-span-mangling, r=nrc	bors	-15/+3
	Don't perform span mangling when building field/tup access nodes There are no guarantees that the two spans used to create the new one come from the same place or are even valid. Fixes #36081.
2016-12-18	Refactor out `mark.as_placeholder_id()`.	Jeffrey Seyfried	-5/+9

2016-12-18	Avoid including attributes in bang macro invocations.	Jeffrey Seyfried	-25/+19

2016-12-18	Remove scope placeholders, remove method `add_macro` of `ext::base::Resolver`.	Jeffrey Seyfried	-85/+38

2016-12-18	Remove `MacroDef`'s fields `imported_from` and `allow_internal_unstable`,	Jeffrey Seyfried	-10/+4
	remove `export` argument of `resolver.add_macro()`.
2016-12-18	Add `ident.unhygienize()` and use `Ident` more instead of `Name` in `resolve`.	Jeffrey Seyfried	-4/+8

2016-12-18	Remove some unused functions and fix formatting.	Jeffrey Seyfried	-36/+6

2016-12-17	Auto merge of #38279 - KalitaAlexey:issue-8521, r=jseyfried	bors	-1/+1
	macros: allow a `path` fragment to be parsed as a type parameter bound Allow a `path` fragment to be parsed as a type parameter bound. Fixes #8521.
2016-12-17	Auto merge of #38205 - jseyfried:fix_module_directory_regression, r=eddyb	bors	-23/+30
	macros: fix the expected paths for a non-inline module matched by an `item` fragment Fixes #38190. r? @nrc
2016-12-16	Allow path fragments to be parsed as type parameter bounds in macro expansion	Kalita Alexey	-1/+1

2016-12-12	Auto merge of #38049 - frewsxcv:libunicode, r=alexcrichton	bors	-2/+2
	Rename 'librustc_unicode' crate to 'libstd_unicode'. Fixes https://github.com/rust-lang/rust/issues/26554.
2016-12-09	Fix mistake.	Without Boats	-1/+1

2016-12-09	Improve error message.	Without Boats	-1/+1

2016-12-09	Prevent where < ident > from parsing.	Without Boats	-0/+17
	In order to be forward compatible with `where<'a>` syntax for higher rank parameters, prevent potential conflicts with UFCS from parsing correctly for the near term.
2016-12-07	Auto merge of #38191 - oli-obk:clippy_is_sad, r=eddyb	bors	-104/+117
	annotate stricter lifetimes on LateLintPass methods to allow them to forward to a Visitor this unblocks clippy (rustup blocked after #37918) clippy has lots of lints that internally call an `intravisit::Visitor`, but the current lifetimes on `LateLintPass` methods conflicted with the required lifetimes (there was no connection between the HIR elements and the `TyCtxt`) r? @Manishearth
2016-12-07	macros: fix the expected paths for a non-inline module matched by an `item` ↵	Jeffrey Seyfried	-23/+30
	fragment.
2016-12-06	Don't perform span mangling when building field/tup access nodes	Sean Griffin	-15/+3
	There are no guarantees that the two spans used to create the new one come from the same place or are even valid. Fixes #36081.
2016-12-06	Auto merge of #37973 - vadimcn:dllimport, r=alexcrichton	bors	-7/+0
	Implement RFC 1717 Implement the first two points from #37403. r? @alexcrichton
2016-12-06	annotate stricter lifetimes on LateLintPass methods to allow them to forward ↵	Oliver Schneider	-104/+117
	to a Visitor
2016-12-04	Auto merge of #38087 - jooert:remove_unmarked, r=petrochenkov	bors	-4/+3
	Remove the `unmarked_api` feature Closes #37981.
2016-12-04	Auto merge of #38082 - jseyfried:macro_invocation_paths, r=nrc	bors	-6/+1
	macros: support invocation paths (e.g. `foo::bar!()`) behind `#![feature(use_extern_macros)]` r? @nrc
2016-12-03	Auto merge of #38079 - BurntSushi:attrtarget, r=alexcrichton	bors	-0/+7
	Add new #[target_feature = "..."] attribute. This commit adds a new attribute that instructs the compiler to emit target specific code for a single function. For example, the following function is permitted to use instructions that are part of SSE 4.2: #[target_feature = "+sse4.2"] fn foo() { ... } In particular, use of this attribute does not require setting the -C target-feature or -C target-cpu options on rustc. This attribute does not have any protections built into it. For example, nothing stops one from calling the above `foo` function on hosts without SSE 4.2 support. Doing so may result in a SIGILL. I've also expanded the x86 target feature whitelist.
2016-12-01	Remove the "linked_from" feature.	Vadim Chugunov	-7/+0

2016-12-02	limit the length of types in monomorphization	Ariel Ben-Yehuda	-0/+1
	This adds the new insta-stable `#![type_size_limit]` crate attribute to control the limit, and is obviously a [breaking-change] fixable by that.
2016-11-30	Support paths in macro invocations.	Jeffrey Seyfried	-6/+1